Methods and apparatus for detecting remote control of a client device

ABSTRACT

A system, method, and apparatus for detecting remote control of a client device are disclosed. An example network security apparatus includes a network switch configured to route first data packets between a client device and a content provider device, determine IP addresses of other devices that transmit second data packets to or receive second data packets from the client device, and throttle the second data packets destined for the client device. The apparatus also includes a controller configured to receive signal packets indicative of activity in relation to a webpage provided by the content provider device to the client device and instruct the network switch to throttle the second data packets after receiving one of the signal packets. The controller is also configured to provide an indication of a malicious device remotely controlling the client device responsive to not receiving another signal packet within a specified time period.

PRIORITY CLAIM

The present application claims priority to and the benefit of U.S. Provisional Patent Application No. 62/161,530 filed on May 14, 2015, the entirety of which is incorporated herein by reference.

BACKGROUND

Traditional communication systems address certain reliability and performance issues that arise during the transfer of information from a sender to a receiver through a medium. In an idealized situation, no errors are introduced as the information travels through the medium. As a result, the receiver obtains, with 100% fidelity, a message identical to the one transmitted into the medium by the sender.

In actual practice however, the medium is not error free. Environmental factors typically contribute haphazard information in the medium. This haphazard information is commonly referred to as “noise”. This noise can result from, for example, shot noise, neighboring radio frequencies, undesirable voltage and/or current fluctuations in circuit components, signal reflections from trees/buildings, solar flares, etc.

In information warfare, there exists a related concept of signal jamming. The idea is to increase the contribution of the noise to such an extent that it becomes practically impossible to find a set of codewords that are simultaneously robust and efficient. This type of noise is not haphazard but rather specifically crafted to render a specific medium too noisy to use. The targets of this type of purposefully crafted noise are unable to communicate.

An important purpose of traditional communication systems are to characterize a noise source and to create a set of primary codewords that are robust against that noise type. The primary codewords are designed to be efficient for communication of a wide variety of often used messages. As provided by traditional communication systems, the transmission of information through the Internet occurs over a variety of medium including cable, wireless, satellite, etc. Currently, traditional communication systems play a significant role in engineering and assuring the reliability and efficiency of those transmissions against a variety of haphazard noise sources.

Traditional communication systems have reduced the effects of haphazard noise in the communication medium as well at the sender and the receiver. For example, the sender or the receiver can include circuitry to reduce or eliminate the effects of haphazard noise. Additionally, routing devices in the medium, the sender, and the receiver can also use quality of service, data integrity, and/or error correction functions to correct for haphazard noise. These functions can be associated with, for example, network cards and associated stacks as received packets are queued and recombined into a complete data stream.

In addition to haphazard noise, there also exists engineered malicious noise specifically created to affect, alter, or otherwise interfere with communications between a sender and a receiver. This malicious noise is an injected signal that alters codewords sent between senders and receivers in a manner that is generally not correctable by existing error correction methods of traditional communication systems. The malicious noise, created by malicious applications, are directed to interfere with communications anywhere along a communication channel through the Internet from a sender to a receiver including routers, switches, repeaters, firewalls, etc.

The malicious applications are configured to identify codeword sets and provide malicious noise that effectively switches one valid codeword for a second valid codeword. Traditional error correction schemes cannot detect this switch because they have no way of identifying that an error has occurred. The resulting altered signal is a viewed as a valid codeword from the point of view of the traditional communication system. Other types of noise that commonly occur in information warfare are also deliberate and engineered (e.g. signal jamming) but the phenomena does not result in a useable codeword set.

Unlike environmentally derived haphazard noise, this malicious noise does not consist of haphazard content, nor does it disallow effective communication as a jamming signal might. Instead, this noise is specifically crafted to substitute the originally transmitted message for a second, specific, legitimate, and understandable message which is then presented to a receiver as authentic intent of the sender. The crafted noise may also occur before selected information leaves a sender (e.g., a server, database and/or directory structure) for transmission to a receiver. This crafted noise is referred to herein as malicious noise. The crafter of the malicious noise of referred to here in as a malicious application.

Using malicious noise, viruses and other types of malicious applications are able to direct a client device (e.g., a receiver) to perform actions that a communicatively coupled server (e.g., a sender) did not originally intend. Additionally, the viruses and malicious applications are able to direct a server to perform actions that communicatively coupled client devices did not originally intend. Conventional virus detection algorithms often fail to detect the malicious nature of the noise because these algorithms are configured to detect the presence of the noise's source rather than the noise itself. The noise generation algorithm (e.g., the code of the malicious application) is relatively easily disguised and able to assume a wide variety of formats. There is accordingly a need to validate communications between servers and client devices in the presence of malicious noise.

SUMMARY

The present disclosure provides a new and innovative system, method, and apparatus for detecting remote control of a client device. Typically, malicious command and control systems are configured to obtain control of a client device to take advantage of the fact that the client device may be designated as a trusted device by some third-party servers. Communication between the client device and a designated third-party server is made within a session. In contrast, remote control of the client device by a malicious command and control system is made outside the session. The example system, method, and apparatus disclosed herein are configured to detect malicious command and control systems by monitoring in-session communications between a client device and a designated third-party server while changing or altering the flow of communications to other third-party servers or devices that are outside from the in-session communications. The example system, method, and apparatus determine that a malicious application is controlling the client device by detecting changes to the in-session communications as a result of the altered flow of communications outside of the session.

In an example embodiment, a network security apparatus includes a network switch configured to route first data packets between a client device and a content provider device and designate the client device as a protected device conditioned on determining the content provider device accessed by the client device belongs to a pre-specified set of endpoints. The network switch is also configured to determine at least one of Internet Protocol (“IP”) addresses and ports of other devices that transmit second data packets to or receive second data packets from the protected device and throttle the second data packets destined for the protected device. The example apparatus also includes a controller communicatively coupled to the network switch and configured to receive a first signal packet from the protected device via the network switch indicative that a webpage provided by the content provider device to the client device is to be rendered and receive a second signal packet from the protected device via the network switch indicative that a username is being typed into a username field of the webpage. The controller is also configured to instruct the network switch to throttle the second data packets after receiving the second signal packet and receive a third signal packet from the protected device via the network switch indicative that a password is being typed into a password field of the webpage. The controller is further configured to instruct the network switch to resume normal routing of the second data packets conditioned on the third signal packet being received within a specified time period beginning when the second signal packet was received. Responsive to not receiving the third signal packet within the specified time period, the example controller is configured to provide an indication of a malicious device remotely controlling the protected device.

In another example embodiment, a method includes detecting within a web browsing application operating on a client device that a webpage from a content provider server is being rendered. The method also includes transmitting a first signal packet to a controller via a network switch indicative that the webpage is in the process of being rendered and detecting a user of the client device has begun to type first information into a first field of the webpage. The method further includes transmitting a second signal packet to the controller indicative that the user has begun to type the first information and detecting the user of the client device has begun to type second information into a second field of the webpage. Moreover, the method includes transmitting a third signal packet to the controller indicative that the user has begun to type the second information and responsive to receiving a protection message from the controller, instructing the web browsing application to suspend operation.

In yet another example embodiment, a system for detecting a malicious device remotely controlling a client device includes a security application operating embedded within a website operated by a content provider server, the website being displayed by a web browsing application on the client device. In this embodiment, the security application is configured to transmit signaling packets indicative of in-session activity of the web browsing application with respect to the website. The system also includes a controller configured to use the signaling packets to delay data packets between the client device and other severs and user the signaling packets to determine if the in-session activity is affected by the delay of the data packets. Responsive to determining the in-session activity is affected, the controller is configured to provide an indication of the malicious device remotely controlling the client device. The system further includes a network switch communicatively coupled to the controller, the client device, the other devices including the malicious device, and the content provider server, the network switch configured to route the data packets and delay routing of the data packets.

Additional features and advantages of the disclosed system, method, and apparatus are described in, and will be apparent from, the following Detailed Description and the Figures.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 is a block diagram of an example network communicating system, according to an example embodiment of the present disclosure.

FIG. 2 is a detailed block diagram showing an example of a client device, application server, or database server according to an example embodiment of the present invention.

FIG. 3 is a diagram of a communication session between a client device and an application server.

FIG. 4 shows a diagram of backscattered channel information during the communication session between the server and the client device using the communication channel of FIG. 3.

FIGS. 5 and 6 show diagrams representative of a malicious application affecting the communication session of FIG. 3 between the client device and server.

FIGS. 7 to 9 show datagrams created by a security processor to identify malicious applications.

FIG. 10 shows a diagram of a datagram that includes an example of coded soft information selected by a security processor.

FIG. 11 shows a client device including rendered information and un-rendered information as a variation of soft messaging.

FIG. 12 shows a security processor validating a transaction between a client device and a server.

FIG. 13 shows an illustration of two different configurations of a client device that can be accounted for by a security processor to create multiple predictions.

FIG. 14 shows a diagram of a data structure 1400 of a prediction formed by a security processor based on soft information and secondary information acquired from global and local observable temporal channel information.

FIGS. 15 and 16 illustrate a flow diagram showing example procedures to validate a communication channel, according to an example embodiment of the present invention.

FIG. 17 shows diagrams comparing messaging without using an example embodiment of the present invention and messaging using an example embodiment of the present invention.

FIGS. 18 to 21 show diagrams of a network communication system and related procedure configured to detect a malicious peer-to-peer distributed command and control system, according to example embodiments of the present disclosure.

DETAILED DESCRIPTION

The present disclosure relates in general to a method, apparatus, and system to validate communications in an open architecture system and, in particular, to predicting responses of client device to identify malicious applications attempting to interfere with communications between servers and the client devices.

Briefly, in an example embodiment, a system is provided that detects malicious errors in a communication channel between a server and a client device. Normally, communication errors between a server and a client device are a result of random channel noise. For instance, communications received by server-client endpoints fall outside of a set of prior selected, recognizable, messages or codewords. Channel errors are usually corrected by existing error correction schemes and internet protocols. The end user is typically unaware that a transmission error has occurred and has been corrected.

Malicious applications typically evade error correcting schemes in two ways: first by altering an original message into an alternative message, and second by creating noise in a segment of a channel where traditional error correction schemes do not operate. In the first way, a malicious application alters an original message into an alternative message that is already in a codeword set of an error correction mechanism. The malicious application may also provide additional messages that are included within the codeword set. As a result, an error correction algorithm is unaware that an error has even taken place and thereby makes no attempt to correct for the error.

In the second way, a malicious application creates noise in a segment of a channel where traditional error correction schemes do not operate. For example, once a packet successfully traverses the Internet and arrives at a network interface of a receiving device, a bit stream of the packet is processed by an application stack under an assumption that no further transmission noise sources will occur. As a result, the application stack does not anticipate errors to occur in the bit stream after processing and thereby makes no attempt to correct for any errors from this channel noise.

Malicious applications create targeted malicious noise configured to interfere with communications between a client device and a server. This channel noise is guided by a deliberate purpose of the malicious application to alter, access, or hijack data and/or content that is being communicated across a client-server connection. Oftentimes, the noise alters communications from original and authentic information to substitute authentic-appearing information. The noise is often induced in a segment of the (extended) channel that is poorly defended or entirely undefended by error correction algorithms. As a result, a malicious application is able to use channel noise to direct a server and/or a client device to perform actions that the client device or server did not originally intend.

In an example, a client device may be connected to an application server configured to facilitate banking transactions. During a transaction, the server requests the client device to provide authentication information (e.g., a username and a password) to access an account. A malicious application detects the connection and inserts malicious noise that causes the client device to display a security question in addition to the username and password prompts (e.g., client baiting). A user of the client, believing the server provided the security question, enters the answer to the security question with the username and password. The malicious application monitors the response from the client device so as to use malicious noise to remove the answer to the security question before the response reaches the server. The malicious application may then use the newly acquired security question to later illegally access the account associated with the client device to improperly withdrawal funds.

In this example, the server is unable to detect the presence of the malicious application because the server receives a proper response to the authentication, namely the username and password. The client device also cannot detect the malicious application because the client device believes the server provided the security question. As a result, the malicious application is able to use channel noise to acquire sensitive information from the client device without being detected by the server or the client.

This client baiting is not the only method used by malicious applications. In other examples, malicious applications may use channel noise to add data transactions between a client device and a server (e.g., add banking transactions). For instance, a client device may specify three bill payment transactions and a malicious application may insert a fourth transaction. In further examples, malicious applications may use channel noise to remove, substitute, or acquire data transmitted between a server and a client, modify data flow between a server and a client, inject graphics or advertisements into webpages, add data fields to forms, or impersonate a client device or a server.

The example method, apparatus, and system disclosed herein overcome at least some of these issues caused by malicious noise by detecting malicious applications through estimated, predicted, or anticipated responses from a client device. The example method, apparatus, and system disclosed herein detect malicious applications by varying soft information describing how hard information is to be displayed by a client device. During any client-server connection, a server provides hard information and soft information. The hard information includes data, text, and other information that is important for carrying out a transaction with a client. The soft information specifies how the hard information is to be rendered and displayed by the client device.

A server uses hard and soft messaging to transmit the hard and soft information to a client device. In some instances, the soft and hard information can be combined into messages before transmission. In other examples, the soft and hard information can be transmitted to a client device in separate messages. As used herein, soft messaging refers to the transmission of soft information to a client device in separate or combined soft/hard messages and hard messaging refers to the transmission of hard information to a client device in separate or combined soft/hard messages.

The example method, apparatus, and system disclosed herein use variations in soft information to form a best guess (e.g., a prediction or estimation) as to how hard information is displayed by a client device. The example method, apparatus, and system disclosed herein then compare a response from the client device to the best guess. If the information included within the response does not match or is not close enough to the prediction, the example method, apparatus, and system disclosed herein determine that a malicious application is affecting communications between a server and a client or, alternatively, provide an indication that a malicious application is affecting communications. As a result of this detection, the example method, apparatus, and system disclosed herein implement fail safe procedures to reduce the effects of the malicious application.

The example method, apparatus, and system disclosed herein uses soft information and messaging as a signaling language to detect malicious applications. In other words, the example method, apparatus, and system disclosed herein create an extended set of codewords for use with a user of a client device to validate that a malicious application is not interfering with communications. The created codeword set installs or uses soft messaging techniques including dynamically linked and/or static libraries, frameworks, browser helper objects, protocol filters, etc. The goal of these soft messaging techniques is to perturb the created communication channel such that the soft information cannot be reverse engineered by the malicious application but is known by the client device and the server.

For instance, FIG. 17 shows diagrams comparing messaging without the example method, apparatus, and system disclosed herein and messaging using the example method, apparatus, and system disclosed herein. Diagram 1700 shows that in the absence of the example method, apparatus, and system disclosed herein, a set of legitimate codewords (denoted by circles) is fixed. Malicious applications know how these codewords are fixed and use malicious noise (denoted by the arrow) to transform a first valid codeword into a second valid codeword. The transformation is undetected by a receiving client device and the sending server.

In contrast, diagram 1710 shows that the example method, apparatus, and system disclosed herein uses variability in soft information and messaging extends the dimensionality of the codeword set. This variability is unknown by the malicious application. Thus, an error occurs when the malicious noise combines with an intended codeword. As shown in diagram 1710, the resulting altered codeword (denoted by an “X”) does not match the set of anticipated recognized codewords, which enables the malicious noise to be detected. The example method, apparatus, and system disclosed herein are accordingly able to use this soft information and messaging variability to detect malicious noise.

As used herein, hard messaging and hard information is transactional text and/or data displayed by a client device. The transactional text, data, pictures, and/or images that can be instructional, informational, functional, etc. in nature. The hard information also includes textual options that are selectable by a client. Hard information is accordingly principal information of a transaction or service provided by a server and presented to a client by a client device.

The hard information includes any type of text and/or data needed by a server to perform a transaction or service on behalf of a client. For instance, hard information of a webpage of an account log-in screen includes text providing instructions to a client as to the nature of the webpage, text for a username field, and text for a password field. After a client has logged into the account, the hard information includes transaction numbers, transaction dates, transaction details, an account balance, and account identifying information. Hard information may be financial (e.g. on-line banking), material (e.g., flow control of raw material in manufacturing processes), or related to data management (e.g., encryption, decryption, addition to or removal from shared storage, copying, deletion, etc.).

As used herein, soft messaging and soft information is presentation information describing how hard information is to be displayed by a client device. Soft information pertains to the installation and/or system usage of dynamically linked and/or static libraries, frameworks, browser helper objects, protocol filters, JavaScript, plug-ins, etc. that are used to display hard information without interrupting the communication of the hard portion of the message between a client device and a server. The soft portion of the message includes information based on a server's selection of protocol, formatting, positioning, encoding, presentation, and style of a fully rendered version of hard information to be displayed at the client device endpoint. The soft information can also include preferences (e.g., character sets, language, font size, etc.) of clients as to how hard information is to be displayed. The precise details of the manner or method in which the direct, client device initiated, response information returns to the server is also a soft component of the communication and may be varied or manipulated without detracting from an ability of the server and client device to conduct e-business, e-banking, etc.

The hard part of the message is constrained, for example, by business utility (e.g., there must be a mechanism for a client device to enter intended account and transaction information and return it to the server) while the soft part of the message has fewer constraints. For example, the order in which a client device enters an account number and a transaction amount usually is not important to the overall transaction. To achieve the business purpose a server only has to receive both pieces of information.

In the client baiting example described above, the example method, apparatus, and system disclosed herein cause the server to transmit to the client device in one or more soft messages code that causes the client device to return coordinates of a mouse click of a ‘submit’ button. These soft messages are included with the other soft messages describing how the authentication information is to be displayed by the client. The server also determines a prediction as to what the coordinates should be based on knowing how the particular client device will render and display the information.

When the malicious application uses malicious noise to insert the security question, the malicious application has to move the ‘submit’ button lower on a webpage. Otherwise, the security question would appear out of place on the webpage in relation to the username and password fields. When a user of the client device uses a mouse to select the ‘submit’ button, the client device transmits the coordinates of the mouse click to the server. The server compares the received coordinates with the coordinates of the prediction and determines that the difference is greater than a standard deviation threshold, which indicates the presence of a malicious application. In response to detecting the malicious application, the server can initiate fail safe procedures to remedy the situation including, for example, requiring the client device to create new authentication information or restricting access to the account associated with the client device.

As can be appreciated from this example, the example method, apparatus, and system disclosed herein provide server-client communication channel validation. By knowing how a client device is to display information, the example method, apparatus, and system disclosed herein enable a server to identify remotely located malicious applications that mask their activities in hard to detect channel noise. As a result, servers are able to safeguard client data and transactions from some of the hardest to detect forms of malicious third party methods to acquire information and credentials. This allows service providers that use the example method, apparatus, and system disclosed herein to provide security assurances to customers and other users of their systems.

Throughout the disclosure, reference is made to malicious applications (e.g., malware), which can include any computer virus, counterfeit hardware component, unauthorized third party access, computer worm, Trojan horse, rootkit, spyware, adware, or any other malicious or unwanted software that interferes with communications between client devices and servers. Malicious applications can interfere with communications of a live session between a server and a client device by, for example, acquiring credentials from a client device or server, using a client device to instruct the server to move resources (e.g., money) to a location associated with the malicious application, injecting information into a form, injecting information into a webpage, capturing data displayed to a client, manipulating data flow between a client device and a server, or impersonating a client device using stolen credentials to acquire client device resources.

Additionally, throughout the disclosure, reference is made to client devices, which can include any cellphone, smartphone, personal digital assistant (“PDA”), mobile device, tablet computer, computer, laptop, server, processor, console, gaming system, multimedia receiver, or any other computing device. While this disclosure refers to connection between a single client device and a server, the example method, apparatus, and system disclosed herein can be applied to multiple client devices connected to one or more servers.

Examples in this disclosure describe client devices and servers performing banking transactions. However, the example method, apparatus, and system disclosed herein can be applied to any type of transaction or controlled usage of resources between a server and a client device including, but not limited to, online purchases of goods or services, point of sale purchases of goods or services (e.g., using Near Field Communication), medical applications (e.g., intravenous medication as dispensed by an infusion pump under the control of a computer at a nurses station or medication as delivered to a home address specified in a webpage), manufacturing processes (e.g., remote manufacturing monitoring and control), infrastructure components (e.g., monitoring and control of the flow of electricity, oil, or flow of information in data networks), transmission of information with a social network, or transmission of sensitive and confidential information.

The present system may be readily realized in a network communications system. A high level block diagram of an example network communications system 100 is illustrated in FIG. 1. The illustrated system 100 includes one or more client devices 102, one or more application servers 104, and one or more database servers 106 connected to one or more databases 108. Each of these devices may communicate with each other via a connection to one or more communication channels in a network 110. The network 110 can include, for example the Internet or some other data network, including, but not limited to, any suitable wide area network or local area network. It should be appreciated that any of the devices described herein may be directly connected to each other and/or connected through the network 110. The network 110 may also support wireless communication with wireless client devices 102.

The client devices 102 access data, services, media content, and any other type of information located on the servers 104 and 106. The client devices 102 may include any type of operating system and perform any function capable of being performed by a processor. For instance, the client devices 102 may access, read, and/or write information corresponding to services hosted by the servers 104 and 106.

Typically, servers 104 and 106 process one or more of a plurality of files, programs, data structures, databases, and/or web pages in one or more memories for use by the client devices 102, and/or other servers 104 and 106. The application servers 104 may provide services accessible to the client devices 102 while the database servers 106 provide a framework for the client devices 102 to access data stored in the database 108. The servers 104 and 106 may be configured according to their particular operating system, applications, memory, hardware, etc., and may provide various options for managing the execution of the programs and applications, as well as various administrative tasks. A server 104, 106 may interact via one or more networks with one or more other servers 104 and 106, which may be operated independently.

The example servers 104 and 106 provide data and services to the client devices 102. The servers 104 and 106 may be managed by one or more service providers, which control the information and types of services offered. These services providers also determine qualifications as to which client devices 102 are authorized to access the servers 104 and 106. The servers 104 and 106 can provide, for example, banking services, online retain services, social media content, multimedia services, government services, educational services, etc.

Additionally, the servers 104 and 106 may provide control to processes within a facility, such as a process control system. In these instances, the servers 104 and 106 provide the client devices 102 access to read, write, or subscribe to data and information associated with specific processes. For example, the application servers 104 may provide information and control to the client devices 102 for an oil refinery or a manufacturing plant. In this example, a user of the client device 102 can access an application server 104 to view statuses of equipment within the plant or to set controls for the equipment within the plant.

While the servers 104 and 106 are shown as individual entities, each server 104 and 106 may be partitioned or distributed within a network. For instance, each server 104 and 106 may be implemented within a cloud computing network with different processes and data stored at different servers or processors. Additionally, multiple servers or processors located at different geographic locations may be grouped together as server 104 and 106. In this instance, network routers determine which client device 102 connects to which processor within the application server 104.

In the illustrated example of FIG. 1, each of the servers 104 and 106 includes a security processor 112. The security processor 112 monitors communications between the client devices 102 and the respective servers 104 and 106 for suspicious activity. The monitoring may include detecting errors in a communication channel between a client device 102 and a server 104 using hard and soft messages, as described herein.

In some embodiments, the security processor 112 may be configured to only detect channel errors that are of strategic importance. This is because malicious applications generally only target communications that convey high value information (e.g., banking information). As a result, using the security processor 112 for important communications helps reduce processing so that the security processor 112 does not validate communications that are relatively insignificant (e.g., browsing a webpage). These important communications can include authentication information, refinements to types of requested services, or details on desired allocation of resources under a client's control. These resources may be financial (e.g., on-line banking), material (e.g., flow control of raw material in manufacturing processes) or related to data management (e.g., encryption, decryption, addition to or removal from shared storage, copying, deletion, etc.).

In an example embodiment, a client device 102 requests to access data or servers hosted by a server 104. In response, the server 104 determines hard information that corresponds to the request and identifies soft information compatible with the hard information. In some instances, the server 104 may use device characteristics or information of the client device 102 to select the soft messaging. Upon selecting the soft and hard messages, the security processor 112 selects how the messages are combined into transmission packets and instructs the server 104 to transmit the packets to the client device 102. To make the packets undecipherable by malicious applications, the security processor 112 may combine hard and soft information, rearrange the order of information transmission, or mix different layers of information.

The unperturbed location of any input boxes or buttons selected by the security processor 112 for soft messaging may vary, subtly, from session to session, without being observable by a client device 102 or a malicious application. For example, the absolute and relative positioning of page elements may be obscured by the incorporation of operating system, browser, and bugz and further obscured by seemingly routine use of byte code and JavaScript. The security processor 112 may also use redundant measures for determining rendered page geometry and activity so that information returned from the client device 102 may be further verified. For instance, benign “pop-up windows” featuring yes/no button messages such as: “would you have time to take our brief customer survey?” may be made to appear or not appear depending on actual cursor or mouse locations when a ‘submit’ button is pressed at the client device 102. Additionally, the security processor 112 may use generic geometrical and content related soft-variations (absolute and relative locations of input boxes and buttons, the appearance or lack of appearance of benign “pop-up” boxes, buttons, advertisements or images) to validate communications with a client device 102. In other words, the security processor 112 may use soft information provided by client devices 102 to also validate a communication channel.

After selecting which soft and hard information to send to the client device 102, the security processor 112 makes a prediction, in this example, as to a location of a ‘Submit’ icon on a fully rendered webpage displayed on client device 102. This icon is part of a banking website provided by application server 104. The security processor 112 may also use backscattered information received from routing components in the network 110 to form the prediction. This backscattered information provides, for example, how the soft and hard information in the transmitted message(s) are processed, routed, and rendered.

The security processor 112 then monitors a response by the client device 102 to identify coordinates of a mouse click of the ‘Submit’ icon. The security processor 112 determines that a malicious application is affecting communications if the prediction does not match the reported coordinates of the mouse clink on the icon. In response to detecting a malicious application, the security processor 112 attempts to prevent the malicious application from further affecting communications with the affected client devices 102. In some embodiments, the security processor instructs the servers 104 and 106 to alter normal operation and enter into a safe operations mode. In other embodiments, the security processor 112 restricts activities of the affected client devices 102 or requests the client devices 102 to re-authenticate or establish a more secure connection. The security processor 112 may also store a record of the incident for processing and analysis. In further embodiments, the security processor 112 may transmit an alert and/or an alarm to the affected client devices 102, personnel associated with the servers 104 and 106, and/or operators of the security processor 112.

While each server 104 and 106 is shown as including a security processor 112, in other embodiments the security processor 112 may be remotely located from the servers 104 and 106 (e.g., the security processor 112 may be cloud-based). In these embodiments, the security processor 112 is communicatively coupled to the servers 104 and 106 and remotely monitors for suspicious activity of malicious applications. For instance, the security processor 112 may provide soft information to the servers 104 and 106. The security processor 112 may also receive client device response messages from the servers 104 and 106. In instances when the security processor 112 detects a malicious application, the security processor 112 remotely instructs the servers 104 and 106 how to remedy the situation.

A detailed block diagram of electrical systems of an example computing device (e.g., a client device 102, an application server 104, or a database server 106) is illustrated in FIG. 2. In this example, the computing device 102, 104, 106 includes a main unit 202 which preferably includes one or more processors 204 communicatively coupled by an address/data bus 206 to one or more memory devices 208, other computer circuitry 210, and one or more interface circuits 212. The processor 204 may be any suitable processor, such as a microprocessor from the INTEL PENTIUM® or CORE™ family of microprocessors. The memory 208 preferably includes volatile memory and non-volatile memory. Preferably, the memory 208 stores a software program that interacts with the other devices in the system 100, as described below. This program may be executed by the processor 204 in any suitable manner. In an example embodiment, memory 208 may be part of a “cloud” such that cloud computing may be utilized by computing devices 102, 104, 106. The memory 208 may also store digital data indicative of documents, files, programs, web pages, etc. retrieved from computing device 102, 104, 106 and/or loaded via an input device 214.

The example memory devices 208 store software instructions 223, webpages 224, user interface features, permissions, protocols, configurations, and/or preference information 226. The memory devices 208 also may store network or system interface features, permissions, protocols, configuration, and/or preference information 228 for use by the computing devices 102, 104, 106. It will be appreciated that many other data fields and records may be stored in the memory device 208 to facilitate implementation of the methods and apparatus disclosed herein. In addition, it will be appreciated that any type of suitable data structure (e.g., a flat file data structure, a relational database, a tree data structure, etc.) may be used to facilitate implementation of the methods and apparatus disclosed herein.

The interface circuit 212 may be implemented using any suitable interface standard, such as an Ethernet interface and/or a Universal Serial Bus (USB) interface. One or more input devices 214 may be connected to the interface circuit 212 for entering data and commands into the main unit 202. For example, the input device 214 may be a keyboard, mouse, touch screen, track pad, track ball, isopoint, image sensor, character recognition, barcode scanner, microphone, and/or a speech or voice recognition system.

One or more displays, printers, speakers, and/or other output devices 216 may also be connected to the main unit 202 via the interface circuit 212. The display may be a cathode ray tube (CRTs), a liquid crystal display (LCD), or any other type of display. The display generates visual displays generated during operation of the computing device 102, 104, 106. For example, the display may provide a user interface and may display one or more webpages received from a computing device 102, 104, 106. A user interface may include prompts for human input from a user of a client device device 102 including links, buttons, tabs, checkboxes, thumbnails, text fields, drop down boxes, etc., and may provide various outputs in response to the user inputs, such as text, still images, videos, audio, and animations.

One or more storage devices 218 may also be connected to the main unit 202 via the interface circuit 212. For example, a hard drive, CD drive, DVD drive, and/or other storage devices may be connected to the main unit 202. The storage devices 218 may store any type of data, such as pricing data, transaction data, operations data, inventory data, commission data, manufacturing data, marketing data, distribution data, consumer data, mapping data, image data, video data, audio data, tagging data, historical access or usage data, statistical data, security data, etc., which may be used by the computing device 102, 104, 106.

The computing device 102, 104, 106 may also exchange data with other network devices 220 via a connection to the network 110 or a wireless transceiver 222 connected to the network 110. Network devices 220 may include one or more servers (e.g., the application servers 104 or the database servers 106), which may be used to store certain types of data, and particularly large volumes of data which may be stored in one or more data repository. A server may include any kind of data including databases, programs, files, libraries, pricing data, transaction data, operations data, inventory data, commission data, manufacturing data, marketing data, distribution data, consumer data, mapping data, configuration data, index or tagging data, historical access or usage data, statistical data, security data, etc. A server may store and operate various applications relating to receiving, transmitting, processing, and storing the large volumes of data. It should be appreciated that various configurations of one or more servers may be used to support and maintain the system 100. For example, servers may be operated by various different entities, including sellers, retailers, manufacturers, distributors, service providers, marketers, information services, etc. Also, certain data may be stored in a client device device 102 which is also stored on a server, either temporarily or permanently, for example in memory 208 or storage device 218. The network connection may be any type of network connection, such as an Ethernet connection, digital subscriber line (DSL), telephone line, coaxial cable, wireless connection, etc.

Access to a computing device 102, 104, 106 can be controlled by appropriate security software or security measures. An individual users' access can be defined by the computing device 102, 104, 106 and limited to certain data and/or actions. Accordingly, users of the system 100 may be required to register with one or more computing devices 102, 104, 106.

The Client-Server Communication Channel

FIG. 3 shows a diagram of a communication session 300 between a client device 102 and an application server 104. The communication session 300 occurs over a communication channel 302, which is included in the network 110 of FIG. 1. The communication channel 302 includes hardware and software components that convey, relay, shape and forward information between the server 104 and the client device 102. The hardware components includes network node devices such as routers, mobile switching center components, base switching center components, data storage, caches, device proxies and firewalls. The hardware components can also include client device specific endpoints, computer architecture, processor types, mobile device chipsets, SIM cards and memory. The software components of the channel include network or endpoint device platforms, instruction sets, operating systems, operating system versions, application programming interfaces (“api”), and libraries. The software components can also include client device endpoint software, user interfaces, browser types, browser versions, cascading style sheets, scripts, document object models, Javacode, byte script, etc.

In the communication channel 302, information transmitted by the server 104 (e.g., soft/hard information included within soft/hard messages) is acted upon, processed, forwarded, and rendered by the various intervening hardware and software channel components. The processing is performed by hardware and software components residing on both network and client device endpoints. The client device 102 is the ultimate recipient of the fully realized, completely processed version of the information transmitted by the server 104. The client device 102 is stimulated by the received (processed) information into prompting a user for decision(s) and/or performing one or more actions. Once a user inputs a decision, the client device 102 communicates a response message to the server 104 through the channel 302.

While FIG. 3 shows one communication channel 302, other communication channels can include different components and corresponding behavioral characteristics that vary from one server-client device connection to another. The behavioral characteristics identify ways in which information is acted upon, processed, forwarded and rendered by the hardware and software components of the channel 302. The security processor 112 uses these behavioral characteristics to help form a prediction of a response from the client device 102.

Once a server-client device connection is established across a channel 302 and the primary, intended function of that communication is initiated (e.g., the type of transaction that is to occur across the channel 302), secondary characteristics and observables are generated in the channel 302 as a consequence. There are two types of secondary characteristics and observables: “global” (involving many or all channel components) and “local” (involving a single, pair, or triple of channel components).

The “global” channel's temporal secondary characteristics are applied across many or all hardware/software components and layers in, for example, the network 110 and include: i) number and size of discrete transmissions, ii) density of discrete transmissions, iii) frequency and other spectral content (e.g., content obtained by discrete Fourier transform, wavelet transform, etc. of an observed time series), and iv) geo-spatial density. These characteristics are derived from observables (e.g., from observation of information flow between client device 102 and server 104) that include, for example, i) delivery times, ii) delivery rates, iii) transmission requests (as reports on errors or inefficiencies), and iv) sequencing or permutations in an order of information processing events. These observables are dependent on a number of factors including, for example, hardware type, software type, and current state (e.g., traffic load, internal queue lengths, etc.) of components that comprise the channel 302.

“Local” observables may also be generated on a per client device basis or per layer basis in the channel 302 of FIG. 3 by server 104 and/or client device 102 initiated stimuli. The variations between client devices or layers are a result of a client's or layer's internal, device specific, information processing prioritization rulesets/protocols, inter-component signaling rulesets, and/or protocols that use hardware or software-based signaling components. The local observables may indicate, for example, a browser type used by the client device 102, an operating system of the client device 102, or device coding schemes used by the client device 102.

In the example embodiment of FIG. 3, the security processor 112 structures the hard and soft messaging output by the server 104 so that the secondary characteristics and observables function as a secondary means of communication between the client device 102 and server 104. At the same time, the security processor 112 structures the hard and soft messaging output by the server 104 in a manner consistent with the original purpose of the connection with the client device 102. As a result, the secondary means of communication between the server 104 and the client device 104 over the channel 302 is configured to not interfere with the primary, intended function of the server-client device interaction. As a result, the security processor 112 uses the channel 302 to vary soft information without changing the nature of the intended transaction between the server 104 and the client device 102.

In FIG. 3, the channel 302 is constructed for universal use (e.g., an open architecture channel). That is, the component and collections of component technologies of the channel 302 are designed to enable a rich variety of server types, client device types, and purposeful communications between the different server and client device types. This enables the security processor 112 to use a variety of different soft messaging methods to achieve the original, intended purpose of the server-client transaction. However, each soft messaging method sets into motion a different set of (global and local) channel characteristic signals and observables. The security processor 112 is accordingly able to establish a secondary communication language between the server 104 and the client device 102 across the channel 302 using the association between variations in soft messaging methods (global, local) and corresponding channel characteristic responses.

The communication session 300 of FIG. 3 also includes malicious applications 304, which are configured to interfere with client-server communications while allowing the primary, intended function of the server-client device interaction to occur. However in accomplishing and creating this perturbation of the primary, intended communication between the server 104 and the client device 102, the malicious applications 304 effectively become an “additional component” of the channel 302, thereby unknowingly affecting the secondary communications. As shown in FIG. 3, the malicious applications 304 can insert information into the channel 302 and/or extract information from the channel 302 using engineered channel noise. The example security processor 112 detects these malicious applications 304 by monitoring how generated malicious channel noise impacts the consistently crafted client-server secondary communications.

FIG. 4 shows a diagram of backscattered channel information 402 during the communication session 300 between the server 104 and the client device 102 using the communication channel 302 of FIG. 3. From the point of view of the server 104 (or a trusted proxy), a complete communication with the client device 102 includes two distinct segments: information sent to the client device 102 and information received from the client device 102 in response to the information sent. Information 402 regarding the progress of channel components in processing, realizing and rendering information and inter-device signaling events, scatter back to the server 104. If the server 104, via the security processor 112, subtly varies the content that it sends to the client device 102 through soft messaging, the effects of the changes will be detectable in the echoed information returning back to the server 104 from the various components and processing layers of the channel 302.

The a priori knowledge of the information transmitted by the server 104 (the information and stimuli actually sent into the channel 302 to the client device 102) together with the global and local backscatter information 402 from the components and layers of the channel 302, permit the server 104 (or a trusted proxy) to form a prediction as to the condition of the final, post-processing, fully rendered version of the information displayed by the client device 102. Additionally, direct, client device initiated, response messages to the server 104 (e.g., mouse clicks or user supplied account information) constitute a means for the security processor 112 to determine a prediction as to the fully rendered version of the information displayed by the client device 102. The information in the response from the client device 102 can be entered by a user using a mouse, keyboard, touchscreen, an infrared ID tag scanner, etc. For example, information of a returned mouse click informs the security processor 112 that a selectable box was 1) rendered, 2) selected, and 3) the click was preformed at (x,y) pixel coordinates.

The security processor 112 determines discrepancies between the prediction and the direct, client device 102 initiated responses of the fully rendered information to detect and identify errors (e.g., malicious applications 304) in the channel 302. The detection and identification of channel error causes the security processor 112 to alter normal operations of the server 104. In some embodiments, the security processor 112 may cause the server 104 to enter a safe operations mode, restrict authorized client device activities, and/or generate an alert and/and/or alarm.

The Use of Soft Messaging For Channel Verification

As discussed above, the security processor 112 can use different types and variations of soft messaging and information to help identify malicious applications. This variation helps prevent malicious applications from reverse engineering the soft messaging and circumventing the approaches described herein. As described below, the variation can include changes to font size, changes to web page arrangement of hard information and graphics, addition of characters to user inputs, changes to function definitions, requests for user prompts through banners and pop-up windows, or implementations of bugz. The variation can also include changing an order in which hard and soft information is sent from a server 104 or a client device 102.

The order in which information arrives at a server 104 or client device 102 is not relevant for business purposes. The inclusion of additional information, for example the pixel location of a mouse click, cursor, or scroll bar (e.g., soft information) in addition to account information (e.g., hard information) does not affect the business purpose. The method of encoding information, and within reasonable bounds, the amount of time information spent in transmission of channel 302 have a generally neutral impact on business purposes. “Soft” choices consistent with the “hard” business purpose exist at many layers of the channel 302 ranging from the choice(s) of physical method(s) used, transmission encoding method(s) used on the physical layer(s), to aesthetic details of information presentation and user interactions with a presented webpage. The choice of soft messaging by the server 104 (or its trusted proxy) corresponding to given hard information is a many-to-one mapping. In a similar way, the local, specialized function and contribution of each network and client device specific hardware and software channel component is decomposable into hard and soft elements consistent with achieving the overall, global intent of the interaction of the server 104 with the client device 102.

The security processor 112 accordingly maintains hard functionality of the server-client device connection (e.g., the session 300) while varying the soft information. Soft information variations are recorded a priori by the security processor 112 or the server 104 (or its trusted proxy) in a data structure to create a large set of composite (hard and soft) messages to be transmitted together. In other embodiments, the server 104 may transmit the hard messages separate from the soft messages. The soft variations are constrained by the fact the final presentation at the client device 102 must be intelligible, not garbled. Further, the soft variations must be of sufficient complexity that the malicious applications 304 are faced with a time consuming reverse engineering problem in deciphering the accumulated impact of the soft message changes throughout the channel 302.

The Use of Bugz in Soft Messaging For Channel Verification

As mentioned above, the security processor 112 may use implementations of bugz in soft information variation. Bugz are anomalous, device, software, protocol and/or physical communication medium specific interpretations of input instructions that produce consistent although unexpected output. Bugz are inherent in many components of the channel 302 and are generally undetectable by malicious applications 304 without significant processing and analysis. The use of bugz helps enhance the complexity of soft messaging by enabling the security processor 112 to craft soft information so that the soft degrees of freedom within and between hardware and software based components of the channel 302 are combined in a multiplicative fashion. While four examples of bugz are described below, the security processor 112 can implement any type of bugz in soft messaging.

One type of bugz is based on different operating systems of client devices 102 processing the same incoming packet streams differently. As a result of this bugz, the security processor 112 can create soft messaging packet streams indented to induce certain known behaviors in an operating system to display hard information. Another type of bugz is based on different operating systems of client devices 102 interpreting the same portion of Extensible Markup Language (“xml”) code differently. Prior to initializing its service to a client device 102, a server 104 or security processor 112 selects from a variety of ways that a portion of xml code may be written and select from a variety of ways to order, time delay, and geographically position the way the packets containing that code are transmitted into the channel 302.

Yet another type of bugz is based on HyperText Markup Language (“html”) code and cascading style sheet instructions that can be written and combined in contrasting and confusing fashion by a server 104 or the security processor 112. The server 104 can also use different layers of the style sheet in opposition of each other. For example, the security processor 112 could instruct a server 104 to randomize which portions of a webpage are sent in style sheet instructions at sequential times. As a result, a malicious application 304 is unable to easily determine which style sheet instruction corresponds to which portion of the webpage.

A further type of bugz is based on code libraries that are internally re-arranged by the security processor 112 so that functions that use the code libraries on client devices 102 are contrasted with expected performance in accord with the usage conventions of the standard library. For example, the security processor 112 can use this type of bugz to swap the definitions of the “add” and “multiply’ functions. As a result of this swap, the client device 102 performs the intended function while a malicious application 304 incorrectly determines that a different function is being performed. As a result, the security processor 112 can determine if a malicious application 304 attempts to change a result of the function or transaction.

Often the ultimate resolution of the purposefully mis-engineered “spaghetti” code applied by the security processor 112 in soft messaging depends on a browser type and version at the client device 102. JavaScript and bytecode, for example, may be similarly obfuscated by the security processor 112 without negatively detracting from run time performance or the ability of the server 104 and client device 102 to conduct business. These effects of the examples described above may be enhanced by incorporating operating system and browser bugz into the instructions. The result of this incorporation is a soft formatting and presentation style at a client device endpoint that makes it difficult for malicious applications 304 to predict and/or automatically interpret the soft information. This makes the soft information difficult for the malicious applications 304 to alter, replace, or counterfeit in real time. Although this encoding is difficult to interpret in real time, it may be easily tested experimentally, a priori by a server 104 (or its trusted proxy). It is this a priori knowledge of the unperturbed and fully implemented rendering of the instruction set at the client device 102 that forms the basis of the prediction determination made by the security processor 112 of the formatting at the client device endpoint. The example security processor 112 creates the variation among the soft messages to increase the differences between the prediction and direct versions of the fully rendered information displayed by the client device 102.

A Comparative Example of Channel Verification With and Without the Security Processor

FIGS. 5 and 6 show diagrams representative of a malicious application 304 affecting the communication session 300 between the client device 102 and server 104. In particular, FIG. 5 shows the affects of the malicious application 304 when the server 104 does not include a security processor 112 and FIG. 6 shows the affects of the malicious application 304 when the server 104 includes the security processor 112. It should be noted that FIGS. 5 and 6 are only one example of communications between a client device 102 and server 104. Other examples can include additional affects by malicious applications 304 and/or different types of transactions performed between the server 104 and client device 102.

In FIG. 5, the server 104 intends to communicate a deliberate, per-determined datagram 402 to client device 102. Here, the datagram 402 is a webpage that prompts a user to provide a username and password. The pre-determined datagram 402 is represented as a binary form for transmission purposes, shown in FIG. 5 as the number “0” in data transmission 404. The final, fully rendered, intended, client device intelligible and/or useable form of the data transmission 404 is known to the server 102 (or its proxy) at and/or before the time of the data transmission to the client device 102.

The pre-determined, intended data transmission 404 progresses through and/or is processed by the various hardware and/or software based components, layers, and protocols of channel 302. The sequence of “0's” represents the original intent of the server 104 and is represented in FIG. 5 as a sequence of “0's” progressing through a sequence of rectangles in the direction of the dashed, horizontal arrows 406. The upper arrow represents the sequence of processing events experienced by the “hard” portion of the data and the lower arrow represents the sequence of processing events experienced by the “soft” data. The soft and hard data transmission paths may or may not be the same and may or may not entail identical processing events.

As transmitted data 404 progresses through and/or is processed by the channel 302 with the original intent of the server 104 intact, secondary information 408 generated by the routing and processing of the data 404 is scattered back through the channel 302 to the server 102. The secondary information 408 can include, for example, an operating system of the client device 102, a browser type used by the client device 102, a cascading style sheet type used to display the soft/hard information, JavaScript information, byte code data, etc. In other instances, the secondary information 408 may be reported by the client device 102 as device information after initiating the communication session 300 with the server 104. The secondary information 408 is generated, for example, from Transmission Control Protocol/Internet Protocol (“TCP/IP”) negotiation, Hypertext Transfer Protocol (“HTTP”) requests and conformations, and/or rendering information. In other examples, the secondary information 408 can be generated through other channel 302 backscattering routing and/or processing.

During transmission of the data 404 to the client device 102, the malicious application 304 creates channel noise 410, which alters the data 404. The channel noise 410 causes an intelligent modification of the data 404 to be realized at the client device 102 instead of the original pre-determined datagram 402. This alteration is represented in FIG. 5 as the number “1” and may incorporate hard and/or soft information.

The client device 102 receives the final, fully rendered, client device intelligible form of the data as altered by the malicious application 104 and displays this data as datagram 412. Here, the channel noise 410 adds a security question to the webpage and moves the location of a ‘submit’ button to accommodate the security question. As a result, of this channel noise 410, the server 104 believes the client device 102 is viewing datagram 402 when in fact the client device 102 is viewing altered datagram 412. Further, a user of the client device 412 has no reason to be suspicious of the datagram 412 because the maliciously inserted security question appears to coincide with the remainder of the datagram 412.

When the client device 102 returns a response message to the server 104, the malicious application 304 detects the response and uses channel noise 410 to remove the answer to the security question. This is represented by transition of the data 404 from “1” to “0” before the data reaches the server 104. As a result, the server 104 receives a response from the client device 102 that only includes the username and password. The server 104 never received an indication that the client device 102 provided a response to a security question, and, accordingly, never detects the presence of the malicious application 304. The malicious application 304 remains hidden to carry out further stealthy compromises of account security.

FIG. 6 shows how security processor 112 can validate communications between the server 104 and the client device 102 during the same communication session 300 described in conjunction with FIG. 5. Similar to FIG. 5, the server 104 in FIG. 6 is to transmit a request for a username and password to access an account. However, unlike in FIG. 5, the security processor 112 in FIG. 6 specifically creates the soft content of a deliberate, pre-determined datagram 502 before transmission to the client device 102.

FIG. 6 shows soft information 504 and hard information 506 transmitted by the server 104. The security processor 112 varies the soft data 504 from one client-server connection to the next to prevent the client device 102 or the malicious application 304 from knowing the components of the soft information 504 beforehand. The soft information 504 is however fully understood by the server 104 (or its trusted proxy) by the time of transmission to the client device 102. The server 104 stores the soft information 504 to a data structure to help form a prediction as to a response from the client device 102. The server 104 initiated soft and hard information 504, 506 is shown as “0's” in the blocks.

During the communication session 300, the propagations of the soft and hard information 504, 506 through channel 302 cause secondary information 508 to be generated. The secondary information 508 is scattered back to the server 104 and the security processor 112. The security processor 112 uses the secondary information 508 in conjunction with the soft information 504 to form a datagram 510 of the prediction. FIG. 7 shows an enlarged image of the datagram 510 including the request for the username and password. The server 104 uses the datagram 510 to predict how the client device 102 will process, render, and display datagram 502. In other embodiments, the security processor 112 stores the secondary information 508 in conjunction with the soft information 504 in a data structure rather then rendering datagram 510.

Similar to FIG. 5, the malicious application 304 uses channel noise 512 to alter the soft and/or hard data 504, 506, which is shown in FIG. 6 as the number “1.” As before, the alternation includes the addition of a security question and the movement of the ‘submit button.’ The client device 102 then receives, processes, renders, and displays the altered data.

A rendered datagram 514, as displayed by the client device 102, is displayed in FIG. 8. This datagram 514 shows a security question prompt below the prompts for the username and the password. In addition, the ‘submit’ button and corresponding text have been lowered in the datagram 514 to make room for the security question. As a result, the security question appears to be genuine to a user of the client device 102.

After displaying the datagram 514, the client device 102 transmits a response, which also includes hard and soft information. Similar to FIG. 5, the malicious application 304 uses channel noise 510 to remove the response to the security question, which is shown in FIG. 6 in the transition of the hard information 506 from “1” to “0.” However, while the malicious application 304 removed the hard information 506, the malicious application 304 is not concerned with the mouse click information, and accordingly does not alter the soft information 504.

In FIG. 6, the server 104 and the security processor 112 receive the response from the client device 102, including the hard and soft information 504, 506. The security processor 112 compares the soft information 504 to the prediction and is able to determine that the communication session 300 has been compromised. In other words, the security processor 112 detects the malicious application 304 by determining that the coordinates of the mouse click on the ‘submit’ button do not match the coordinates of the ‘submit’ button made during the prediction.

FIG. 9 shows a diagram of a comparison datagram 516 representative of the comparison made by the security processor 112 to determine if a malicious application is affecting communications between the server 104 and the client device 102. The comparison datagram 516 includes the prediction datagram 510 formed by the security processor 112 and a construction of the datagram 514 based on the soft and hard information received from the client device 102. For visually effect, the prediction datagram 510 is superimposed upon the datagram 514 from the client device 102.

As shown in FIG. 9, the geometry of the datagram 514 is altered, in particular the position of the ‘submit’ button as a result of the space needed to reformat the page and accommodate the additional bogus security question. In addition to the location of the data fields, the datagram 514 includes soft information 504 such as a position of a mouse click associated with the ‘submit’ button. In this example, the server 104 requests that the client device 102 report the mouse click as soft information, for example, by relying on a “hidden in the clear” communication protocol. In some examples, the server 104 or security processor 112 may embed the authentication form in a “trendy” image so that the relative coordinates of the mouse clicks are returned as a matter of routine and not detected by the malicious application.

In this authentication page example, by comparing the prediction position of the ‘submit’ button with the directly reported position, the security processor 112 detects whether an error has occurred during communication session 300. Here, the security processor 112 detects that the datagram 514 does not align with the datagram 510, and accordingly determines that the malicious application 304 is affecting communications.

In some embodiments, the security processor 112 may determine an allowable deviation or threshold for datagram 510. Thus, as long as, for example, the ‘submit’ button is located within the allowable deviation, the security processor 112 determines that communications are not being affected by malicious applications. The security processor 112 may determine what an allowable deviation is for the datagram 510 based on, for example, secondary information 508, characteristics of the client device 102, or history information of how the datagram 510 has been displayed by other client devices.

Examples of Channel Verification Using Different Types of Soft Messaging

As disclosed, the security processor 112 uses different types and variations of soft information and soft messaging to validate communication channels between servers 104 and client devices 102. The types of soft information and messaging can include changes to font size, changes to web page arrangement of hard information and graphics, addition of characters to user inputs, changes to function definitions, requests for user prompts through banners and pop-up windows, or implementations of bugz. The following sections describe how the security processor 112 uses different types of soft information and messaging.

Soft Messaging Using Text Size and Font Variations

FIG. 10 shows a diagram of a datagram 1000 that includes a code section 1002 and a result section 1004. The datagram 1000 illustrates how soft information can be selected or created by the security processor 112 in code section 1002. The datagram 1000 also shows how the soft information would be displayed on a client device 102 in the result section 1004.

FIG. 10 shows that character sets, font types and point sizes may be varied by the security processor 112 from session to session. These variations are in addition to the geometrical and content related soft-variations described in the previous comparative example. In the code section 1002, keyboard and mouse functionality may be made functions of a number of characters typed or x,y coordinates of text boxes. These modifications may be subtle and may also be made session dependent. The security processor 112 may invoke changes using any seemingly contrasting combination of coding instructions via html, xml, CSS JavaScript, byte code, etc. The security processor 112 may also invoke changes by altering or restricting elements available for coding instructions to draw from, for example, available character sets.

For example, in the datagram 1000, the security processor 112 is subject to a ruleset based on the hard information that is required to be transmitted (e.g., the prompt for a username and password). Here, the security processor 112 selects soft information or message variation such that for the fully processed and rendered information presented to the client device 102 is structured so that the username transaction field is to be rendered by a client device 102 in a font size of 12, the first password field is to be rendered in a font size of 13, and the second password field is to be rendered in a font size of 14. In other examples, the security processor 112 may also vary a font type, font color, font weight, or any other text variation allowable for the corresponding hard information.

The variation among the font sizes is used by the security processor 112 to form a prediction. For instance, the name provided by the client device 102 is to be in 12 point font while the firsst password is to be in 13 point font. If a malicious application uses channel noise to alter the username or password responses or add a second transaction, the security processor 112 is able to detect the modification by the malicious application if the returned font size does not match the prediction. If the malicious application is more sophisticated and processes the soft information returned from the client device 102 to determine the font size, the extra time spent processing the information provides an indication to the security processor 112 that a malicious application is affecting communications. As a result, the soft messaging makes it relatively difficult for a malicious application to go undetected by the security processor 112.

Soft Messaging Using Programmed Keystrokes

In another embodiment, the code section 1002 may include code that instructs a client device 102 to programmatically generate keystrokes based on keystrokes provided by a user. The security processor 112 uses the algorithm for the programmatically generated keystrokes to form a prediction. The security processor 112 transmits the algorithm for the programmatically generated keystrokes through xml code, Java code, etc. The security processor 112 may also use the programmatically generated keystrokes in Document Object Models (“DOMs”) of hidden form fields.

Upon receiving the code, the client device 102 applies the algorithm to the specified data fields. For example, one algorithm may specify that the letter ‘e’ is to be applied after a user types the letter ‘b’ and the number ‘4’ is applied after a user types the number ‘1.’ When the user submits the entered text, the client device 102 transmits the user provided text combined with the programmatically generated keystrokes in a response message. For instance, in the result section 1004 of FIG. 10, the client device 102 may add keystrokes to the user provided username or password.

A malicious application that uses channel noise may attempt to, alter text, inject text, or additional data fields into the response from the client device 102. However, the security processor 112 is able to identify which text was affected by the malicious application based upon which of the received text does not match the algorithm-based keystroke prediction. As a result, the security processor 112 is able to detect the malicious application.

Soft Messaging Using Function Modification

In a further embodiment, the code section 1002 may include code that changes a library definition of one or more functions. For example, the code section 1002 could specify that a function named ‘add’ is to perform division and that a function named ‘subtract’ is to perform addition. The security processor 112 uses the library definitions to form a prediction of a response from a client device 102. The security processor 112 transmits the library definition through, for example, xml code, Java code, etc.

Upon receiving the code, the client device 102 applies the changed library definitions to the specified data fields in, for example, the result section 1004 of FIG. 10. In one instance, the client device 102 may prompt a user to enter a result of a mathematical equation as part of an authentication process or when entering a number of related transactions. A malicious application, attempting to alter the authentication or inject additional transactions, examines the response from the client device 102. The malicious application only sees, at most, the name of the function performed, not the definition of the function. As a result, the malicious application alters the data or applies transactions consistent with the name of the function. However, the security processor 112 is able to detect the malicious application because the received altered response would not be consistent with the functional definitions stored to the prediction.

Soft Messaging Using Un-Rendered Page Elements

FIG. 11 shows client device 102 including rendered information 1102 and un-rendered information 1104 as a variation of soft messaging. The rendered information 1102 is displayed to a user by the client device 102 while the un-rendered information 1104 is not displayed but instead is included within source code of soft information for a document. The security processor 112 uses the un-rendered information 1104 to determine if a malicious application is affecting communications with the client device 102. For instance, the security processor 112 detects a malicious application if an altered response from the client includes reference to the un-rendered information 1104 or accommodates the un-rendered information 1104. While FIG. 11 shows the un-rendered information 1104 as titles, the un-rendered information 1104 may also include redundant/multiple passwords, redundant/multiple forms, or redundant/multiple logical structures in DOM.

Generally, malicious applications use un-rendered, machine-readable source code to perform functions instead of the rendered version of the code. The reason is that rendering the code takes additional time and resources that may expose the malicious application. In the example shown, soft information applied to the source code by the security processor 112 enables the introduction of title and tag variations, redundancies, substitutions, embedded requests for data downloads from arbitrary locations, logical obfuscations, piecewise delivery of a final edition of machine-readable source code, transformations of the machine-readable source code based on features of previous or currently rendered pages, transformations of the machine-readable source code based on intended client interactions with previous or currently rendered pages, etc. in the machine source code version of the page.

The soft modifications applied by the security processor 112 to the machine-readable source code produce a consistent, useable, non-varied rendered page to the intended user while producing a different varied page to the malicious application. In this manner, the intended user interacts freely with the rendered page while the attempts of the malicious application to interact with the un-rendered, machine-readable source results in a failure to interact with the source code. The un-rendered information 1104 may also cause the malicious application to experience excessively long task completion times.

Any modifications or alterations performed by a malicious application result in the activation of placeholder source page elements, which are processed and returned to the security processor 112 as indications that the returned information is based on an edition of the machine source code that was not the final edition intended for the end user. Additionally, the security processor 112 is able to detect that a malicious application altered a response from the client device 102 when the received information includes data with geographic locations or bogus data fields that correspond to the soft information of the un-rendered information 1104. For instance, the security processor 112 detects a malicious application if the response from the client device 102 includes a payee after the ‘Online Poker’ payee.

In addition to using data fields of un-rendered information 1104, the security processor 112 can also use behind-the-scenes, un-rendered, machine-readable source code used to generate communications. The security processor 112 may also use decision process interfaces for the intended client device 102 in technologies where the communications occur via physical medium and protocols other than HTTP traffic traveling through the network 110. Some of these communication examples include Short Message Service (“SMS”) messaging, manufacturing control process signals and protocols (e.g., Foundation Fieldbus, Profibus, or Hart Communication Protocol), and/or infrared or Bluetooth-based communications. The soft messaging techniques may be used by the security processor 112 when the delivery mechanism is not Internet/HTTP based as a way to differentiate between end user presentation, end user interface level and the machine source level of response, and/or interaction with delivered content or information.

In instances when a malicious application uses the interactions and/or input of a legitimate user via a client device 102 as a means to guide itself through the logical flow of the obfuscated, machine-readable source code, the security processor 112 may use soft information that includes the creation of additional “user” input events by the system. Examples of these user input events can include, but are not limited to, keyboard events, user focus events, mouse clicks, mouse rollovers, cursor movements etc. The specific details of the soft information or messaging generated user events are known prior to the security processor 112 as the prediction and may be later removed by server 104 or the security processor 112 to recover the legitimate client device 102 and/or end users intent.

Additionally, in instances when a malicious application exports machine-readable source code to be rendered for processing and/or navigation by a substitute recipient, the security processor 112 can use soft messaging variations among an operating system, a layout engine, a browser, Cascading Style Sheets (“CSS”), JavaScript, bugz, and/or peculiarities acting individually or in combination so that the exported source code compiles and/or renders differently for the substitute client than it does for the originally intended end user. The just-in-time nature of the delivery of the final edition of the machine-readable source code to the intended client device 102 also differentiates between page versions, content versions compiled, and/or rendered at the communicating client device 102. The communicating client device 102 may be the original, intended client or a substitute of the malicious application. The substitute client device may be a computer program and/or technology that replicates the intended end user's powers of observation, recognition and/or understanding.

Soft Messaging Using Graphical Elements

FIG. 12 shows a client device 102 conducting a transaction with a server 104. The transaction is displayed in datagram 1202 and includes three separate transactions totaling an amount of 268.55. In this example, a malicious application 304 intercepts the transmission of the datagram 1202 from the client device 102 to the server 104. The malicious application 304 uses channel noise to add a fourth transaction and a new balance of 332.89 to the datagram 1202. As a result, the server 104 receives four transactions and the correctly appearing balance of 332.89.

To prevent such fraud, the security processor 112 uses graphical elements 1204 as soft information to verify the data transmitted by the client 102. The use of graphical elements 1204 enables the security processor 112 to validate channel communications when a client device 102 is the originator of hard and soft information. In other words, the security processor 112 uses graphical elements 1204 to confirm communications with the client device 102 when the security processor 112 may not be able to form a prediction because the client device is the originator soft and/or hard information. The graphical elements 1204 may be presented to the user of the client device 102 as, for example, a banner, background, image, part of an advertisement, or a video. In some examples, the security processor 112 can use variations in graphical elements 1204 as soft information in conjunction with other soft messaging techniques discussed above.

In the illustrated example of FIG. 12, the security processor 112 transmits the graphical element 1204 to the client device 102. The security processor 112 embeds the balance information as code included within the graphic, which helps prevent the malicious application 304 from detecting and using channel noise to alter the balance to the amount provided originally by the client device 102. The client device 102 accordingly displays the graphical element 1204 including the balance received by the server 104. The user can then compare the balances and provide feedback that the balances do not match by, for instance, selecting the graphical element 1204. In response, the security processor 112 instructs the server 104 to disregard the datagram 1202.

In an alternative embodiment, the security processor 112 enables the client device 102 to supply comparison information. For example, a ‘submit these transactions’ button may be presented by the client device 102 as an active, account balance indexed grid. A user of the device 102 is expected to activate that portion of the button corresponding to the traditionally displayed account balance. As in the previous examples, the details of this button may be session dependent.

In another example, the client device 102 may be enabled by the security processor 112 to send a screen capture of the account information in the datagram 1202 to the server 104 for automated comparison by the security processor 112. The background and other features of the screen capture may be session dependent to prevent counterfeiting. For example the security processor 112 may specify in soft messaging whether the client device 102 is to create and forward a snapshot of the top ⅔ of an account balance or the lower ⅔ of the account balance and/or a blank image followed by the account balance.

Multiple Predictions For a Single Session Embodiment

FIG. 13 shows an illustration of two different configurations of a client device 102 that can be accounted for by the security processor 112 to create multiple predictions in some embodiments. In this example, the security processor 112 creates two different predictions based on an orientation of client device 102. The first prediction corresponds to the client device 102 being in a vertical orientation 1302 and the second prediction corresponds to the client device being in a horizontal orientation 1304.

Oftentimes, many smartphones and tablet computers can display information based on how the device is orientated. However, the orientation of the device is generally not reported back to a server 104 through backscattered secondary information. As a result, the server 104 does not know the orientation of the device when the hard information is displayed. To compensate for this lack of information, the security processor 112 creates two different predictions. In some embodiments, the security processor 112 may generate, by default, multiple predictions regardless of a type of client device 102 to account for different screen sizes, orientations, etc. In other embodiments, the security processor 112 may generate a second prediction only after receiving backscatter information that indicates the client device 102 corresponds to a type of device that can have more than one orientation.

In the illustrated example of FIG. 13, the security processor 112 creates a first prediction as to how the hard information (e.g., username, password and ‘submit button) is displayed based on the received soft information. The security processor 112 determines that coordinates of the features displayed by client device 102 have to fit within the vertical orientation 1302 of the client device 102. Similarly, the security processor 112 creates a second prediction as to how the hard information will be displayed within the horizontal orientation 1304. The differences between the orientations 1302, 1304 can include spacing between data fields, sizes of the data fields, location of the ‘submit’ button, and a location of the trademark. The security processor 112 then compares a response from the client device 102 to each of the predictions to determine if a malicious application is affecting communications.

Prediction Data Structure

FIG. 14 shows a diagram of a data structure 1400 of a prediction formed by the security processor 112 based on soft information and secondary information acquired from global and local observable temporal channel information. The data structure 1400 is representative of information used by the security processor 112 to form the prediction. In other embodiments, the security processor 112 may render a webpage based on the soft and secondary information, similar to the datagram 510 of FIGS. 5 and 7.

The example security processor 112 uses the information in data structure 1400 to determine if a response from a client device 102 is indicative of a malicious application affecting communications. The security processor 112 creates the data structure 1400 by storing soft information used in soft messaging by a server 104. The security processor 112 supplements the data structure 1400 with secondary information received as backscatter information. As mentioned before, the soft information describes how hard information is displayed or presented while the secondary information provides indications how the soft and hard information are to be displayed on a client device 102.

In the illustrative example of FIG. 14, the soft information includes font type, font size, and positioning of three text fields. The soft information also includes coordinates of a ‘submit’ button including an allowable deviation or predetermined threshold. The soft information further includes programmed text to be generated automatically in the text fields and a location of a banner graphical element. In addition, the soft information includes un-rendered text at specified coordinates.

Also in the data structure 1300 of FIG. 14, the secondary information includes a browser type and operating system of the client device 102. The secondary information also includes an indication that JavaScript is enabled. The security processor 112 uses the secondary information to modify the soft information as needed. For example, upon receiving an indication that a client device 102 is using an OPPS browser, the security processor 112 updates coordinates of the text fields and ‘submit’ button to reflect how the OPPS browser is known to format and render text and graphics. In this manner, the secondary information is used by the security processor 112 to refine or alter the initial prediction made when the soft information was initially transmitted to the client device 102.

Flowchart of the Example Process

FIGS. 15 and 16 are a flow diagram showing example procedures 1500, 1530, and 1560 to validate a communication channel, according to an example embodiment of the present invention. Although the procedures 1500, 1530, and 1560 are described with reference to the flow diagram illustrated in FIGS. 15 and 16, it will be appreciated that many other methods of performing the acts associated with the procedures 1500, 1530, and 1560 may be used. For example, the order of many of the blocks may be changed, certain blocks may be combined with other blocks, and many of the blocks described are optional.

The example procedure 1500 operates on, for example, the client device 102 of FIGS. 1 to 6. The procedure 1500 begins when the client device 102 transmits a connection request to a server 104 (block 1502). The connection request can include a website address or IP address that is routed by the network 110 to the appropriate server 104. The connection request can also include device information identifying secondary characteristics or information associated with the client device 102.

After receiving a connection response, the client device 102 requests to engage in a data transaction with the server 104 (block 1504). The request can include a specification of information that the client device 102 desires to read or write to information stored in a database or managed by the server 104. The request can also include one or more transactions that the client device 102 desires to complete with the server 104.

Some time after transmitting the request, the client device 102 receives hard and soft information 1507 corresponding to the requested transaction (block 1506). The hard and soft information 1507 can be received in separate messages or combined together in one or more messages. The client device 102 uses the soft information to determine how the hard information is to be rendered and displayed (block 1508). After displaying the hard information, the client device 102 transmits a response message 1509 provided by a user (block 1510). At this point, the example procedure 1500 ends when the client device 102 and server 104 stop exchanging communications (e.g., terminate a communication session). Additionally, in some embodiments, the client device 102 may receive an indication from the server 104 that a malicious application has affected at least the information in the response message 1509. As a result, the client device 102 could re-authenticate communications with the server 104 or enter a failsafe mode.

The example procedure 1530 of FIG. 15 operates on, for example, the application server 104 of FIGS. 1 to 6. The procedure begins when the server 104 receives a connection request from a client device 102 (block 1532). In instances that the connection request includes device information, the server 104 transmits the device information to a communicatively coupled security processor 112. The server 104 then transmits a connection response to the client device 102, thereby initiating a communication session (block 1534).

Some time later, the server 104 receives from the client device 102 a request to process a data transaction (block 1536). The server 104 then determines hard information 1537 associated with the requested data transaction (block 1538). For example, a request to access an account causes the server 104 to identify account log-in information. In another example, a request to perform a banking transaction cases the server 104 to identify account information and available banking options for the account. The server 104 then transmits the determined hard information 1537 to a security processor 112. In some embodiments, the security processor 112 may be instantiated within the server 104. In other embodiments, the security processor 112 may be remote from the server 104.

Responsive to receiving hard and soft information 1507 from the security processor 112, the server 104 formats and transmits the information 1507 to the client device 102 (block 1540). In some embodiments, the server 104 receives messages with combined hard and soft information. In these embodiments, the server 104 formats the messages (e.g., structures the messages into data packets) for transmission. In other embodiments, the server 104 receives the hard and soft information. In these other embodiments, the server 104 combines the hard and soft information into one or more messages and formats these messages for transmission. The server 104 accordingly provides the client device 102 with hard and soft messaging.

After transmitting the hard and soft information 1507, the server 104 of FIG. 16 receives backscattered information 1543 from channel components used to process, route, and render the information 1507 (block 1542). The server 104 transmits this backscattered information 1543 to the security processor 112. In some instances, the server 104 transmits the information 1543 as the information is received. In other instances, the server 104 transmits the information 1543 periodically or after receiving an indication that the soft and hard information 1507 has been received and processed by the client device 102.

The server 104 then receives the response message 1509 from the client device 102 including information responding to the hard information (block 1544). The server 104 subsequently transmits the response message 1509 to the security processor 112. After the security processor 112 has compared information in the response message 1509 to a prediction, the server 104 determines whether the communication session with the client device has been validated (block 1546). If the security processor 112 does not provide an indication of a malicious application, the server 104 determines the communication session with the client device 102 is validated. The server 104 continues communications with the client device 102 and continues to validate communications until the communication session is ended.

However, responsive to the security processor 112 providing an indication of a malicious application, the server 104 enters a failsafe mode (block 1548). The failsafe mode can include the server 104 informing the client device 102 of the malicious application, requesting that the client device 102 re-authenticate, restricting access to the data transactions associated with the client device 102, transmitting an alarm or alert to appropriate personnel, and/or applying a routine or algorithm to remove or restrict further attempts by the malicious application to affect communications. Regardless of which failsafe operation is performed, the example procedure 1530 ends when the communication session with the client device 102 is terminated or when the effects of the malicious application have been remedied.

Returning to FIG. 15, the example procedure 1560 operates on, for example, the security processor 112 of FIGS. 1 to 6. The procedure 1560 begins when the security processor 112 receives device information from the server 104 (block 1562). This step can be skipped in instances where a connection request does not include device information. The security processor 112 then receives hard information 1537 from the server 104 and identifies compatible soft information (block 1564). For instance, hard information has a limited number of ways that it can be correctly displayed. The security processor 112 uses this relationship to identify which soft information is compatible with the hard information.

After identifying the compatible soft information, the security processor 112 selects a variation of the soft information (block 1566). The security processor 112 may select a different variation of soft information for each client device-server connection. As described before, this variation prevents malicious applications from reverse engineering the soft messaging used to validate communications. The security processor 112 then combines the hard information and the selected soft information 1507 into one or more messages and transmits combined information 1507 to the server 104, which then transmits the information 1507 to the client device 102 (block 1568). The security processor 112 also forms a prediction as to how the client device 102 will render and display the hard information based on the soft information (block 1570).

In FIG. 16, the security processor 112 receives the backscattered information 1543 from the server 104 and determines corresponding secondary information or characteristics (block 1572). The security processor 112 then updates or modifies the prediction based on the secondary information (block 1574). Responsive to receiving the response message 1509 from the client device 102, the security processor 112 compares the information in the response to the prediction (block 1576). The comparison includes determining if soft information returned by the client device 102 matches or is within an allowable deviation to corresponding soft information in the prediction (e.g., matching coordinates of graphics or data fields, matching programmatically entered characters, or matching font information) (block 1578).

Responsive to determining the information in the response matches the prediction, the security processor 112 validates the communication session between the server 104 and the client device 102 (block 1580). The security processor 112 then continues to validate the communication session for additional communications between the server 104 and the client device 102 until the communication session is ended. Responsive to determining the information in the response deviates from the prediction, the security processor 112 provides an indication of a malicious application (block 1582). The security processor 112 may also remedy the effects of the malicious application or take steps to prevent the malicious application from affecting further communications between the client device 102 and the server 104. The security processor 112 then continues to validate the communication session for additional communications between the server 104 and the client device 102 until the communication session is ended.

Remote Access Embodiment

Known network security devices operating in relation to network switches, network routers, Internet routers, network controllers, etc. are configured to monitor IP traffic flows (e.g., the flow of data packets) into a network or out of a network for suspicious activity, malicious applications, and/or malware servers. The network security devices determine connection information by monitoring the IP traffic flow. The connection information may include source and destination IP addresses, source and destination port numbers, packet sizes, and header information. The network security devices use the connection information to monitor the IP traffic flows for IP addresses or ports (or even packet content) associated with known malicious applications. As addresses or other endpoints of the malware servers are discovered and broadcast (or made publically available), network operators update network devices to restrict IP traffic flows associated with these malicious devices. The restriction may include blocking traffic that originates from an identified network endpoint or data packets with a certain IP address.

It should be appreciated that the success of endpoint or IP address blocking is based on actually knowing beforehand the IP address or endpoint that is associated with a malicious or malware server. This is not always the case, especially for relatively new malicious endpoints or servers. Further, malware engineers have developed peer-to-peer types of distributed command and control systems that make identification of a malicious device significantly more difficult because the malicious device operates behind a virtual curtain of seemingly friendly or normal devices.

Historically, sensitive traffic flows (e.g., financial traffic) have been encrypted to prevent unauthorized users from viewing/stealing/manipulating/altering data within the data packets. However, encryption makes monitoring IP traffic flows and datagrams of the data packets more difficult because the contents of the flows cannot be determined (or easily determined). More recently, many other traffic flows are also being encrypted over concerns of privacy. For instance, even simple web searches are encrypted. This encryption trend makes it increasingly difficult to monitor and analyze IP traffic flow details/contents and accordingly defend networks from attack. Thus, as more IP traffic becomes encrypted and more malicious devices operate in a peer-to-peer configuration, known methods of monitoring IP traffic flows are becoming outdated or obsolete.

As discussed above, malware servers may use peer-to-peer distributed command and control systems to achieve a specific malicious purpose. Remote access toolkits, such as Poison Ivy and Dark Comet are configured to enable an attacker using a malware computer or server to steal authentication credentials from a victim's computer or smartphone. The remote access toolkits enable the attacker to later use the stolen credentials in conjunction with remote control of the victim's own computer (e.g., endpoint) to gain access to sensitive and protected information at the victim's computer or hosted by a third-party (e.g., a bank or other content provider). From the viewpoint of known network security devices and network-level defensive technologies, the network connection data reveals that the connection (e.g., access to a webpage) to the third-party originates from an approved client endpoint device with appropriate geolocation and other identifying machine data (e.g., a MAC address or kernel-level information).

In this example, an external examination of the network traffic flow into and out of the approved client endpoint would likely reveal multiple active connections during the session or connection to one or more third-party servers. Most of these active connections are legitimate, such as a connection to a streaming music/video content provider, a connection for a chat session, an email connection, connections to webpages of other third-party content providers, connections opened by background applications or widgets, etc. Mere simultaneity in the time of a certain connection to, for example, a banking website with another network service does not on its own indicate that an endpoint has been compromised and is under the influence of a remote access user or malicious server/device/application. Further, white listing of known and trusted service IP addresses in one or more data structures does not enable a network device to entirely narrow down whether an IP traffic flow includes data packets to and/or from a remote malicious device. There are thousands to millions of IP endpoints that are safe which may not be independently verified as being legitimate. In addition, a malicious server may very quickly acquire or use a server or other device associated with a white listed IP address. Accordingly, there are no guarantees that a known trusted server or other endpoint device itself has not been compromised either directly through control of a network device along a routing path of service traffic. It should be appreciated then that examining IP data packet traffic in depth for proof of a service type may be helpful in isolating malicious IP traffic. However, as discussed above, more IP traffic is encrypted making such verification difficult.

The example method, apparatus, and system discussed herein are configured to detect malicious peer-to-peer distributed command and control systems by monitoring and influencing activity within a client browser application or API while a connection is in progress. The example method, apparatus, and system provide details of this in-session activity, which may be signaled via IP packet traffic to a monitoring network traffic control appliance (e.g., software-defined network (“SDN”) enabled switch and/or controller). IP traffic separate from in-session traffic may be controlled (e.g., queued, throttled, resent, dropped, etc.) by the network switch during the session. The example method, apparatus, and system determine how controlling of the other IP traffic affects the in-session activity of the browser and sends the observed in-session browser activity to the network switch. In other words, the example method, apparatus, and system intentionally slows or stops IP traffic not associated with an in-session connection to a website, for example, to determine if the control affects the in-session connection. Such an arrangement identifies malicious peer-to-peer distributed command and control systems that use out-of-session traffic to relay captured screen shots of a target or victim device and keystrokes or other inputs between the malicious/malware server and the victim's computer.

The example method, apparatus, and system use the in-session signaling to determine if there is indeed a malicious peer-to-peer distributed command and control system manipulating a victim's computer, thereby thwarting any attempts to disguise the control data through encryption or posing as another service. The example method, apparatus, and system may use the signaling to determine, for example, that it is taking a user longer to provide a password, enter a numerical value, or select a hyperlink after the out-of-session traffic was slowed or otherwise controlled. This indicates, for example, that a malicious peer-to-peer distributed command and control system was using the out-of-session IP traffic to provide the in-session information.

The in-session signaling includes transmitting from a victim's computer signaling packets that are distinct from the IP data packets being communicated between the computer and the malicious server and/or a third-party application server. The signaling packets may include secondary channel packets created by, for example, an application operating in conjunction with a website displayed by the victim's computer. The signaling packets are configured to communicate in-session information indicative of an operational state or activity within an application web browsing application displaying the website or webpage. It should be appreciated that such in-session information is not detectable by external network components such as switches and controllers because the information (if it is transmitted at all) is usually encrypted.

FIG. 18 shows a diagram of a network communication system 1800 configured to detect a malicious peer-to-peer distributed command and control system 1802 (e.g., a malicious server or application), according to an example embodiment of the present disclosure. The example communication system 1800 includes an application server 1804, which may include any third-party content provider server, database server, etc. The application server 1804 is configured to host or operate at least one service via an API, website, database, etc. The example communication system 1800 also includes network switches 1806 a and 1806 b and network router 1808 configured to route IP traffic including IP data packets across a network 110 between the application server 1804 and client devices 102 a, 102 b, and 102 c.

Each of the network switches 1806 a and 1806 b is communicatively coupled to and/or include respective controllers 1810 (e.g., SDN controllers). The controllers 1810 are configured to operate as a strategic control point in a network for relaying control information between switches/routers via one or more APIs. Each of the controllers 1810 may include the security processor 112 of FIGS. 1 to 17. Alternatively, the controllers 1810 may be programmed with instructions that correlate signaling from a webpage into one or more instructions provided to the switches 1806.

In the illustrated example, the client device 102 c accesses a webpage 1812 of a website hosted or operated by the application server 1804. The malicious server 1802 (e.g., a dark comet server) uses a peer-to-peer distributed command and control system to remotely access the client device 102 c (e.g., a dark comet client), as denoted by communication path 1814. For instance, the malicious server 1802 may have instructed the client device 102 c to access the application server 1804 via a web browsing application operating on the client device 102 c. The in-session connection between the client device 102 c and the application server 1804 is denoted by communication path 1816. It should be appreciated that the malicious server 1802 uses the out-of session communication path 1814 to communicate in-session with the application server 1804 via the path 1816 by using the endpoint of the client device 102 as a cover.

The malicious server 1802 causes the client device 102 c to record screen shots, keystrokes and other inputs by a user in addition to information (e.g., a copy of the webpage) regarding the webpage 1812. The client device 102 c transmits this information to the malicious server 1802 via communication path 1814. Additionally, the malicious server 1802 transmits instructions and other information to the client device 102 c, which are relayed to the application server 1804 via the in-session communication path 1816. The instructions may include information typed into fields of a webpage, webpage navigation commands, object manipulation, etc. It should be appreciated that a user of the client device 102 c may or may not be present while the malicious server 1802 is accessing or using the client device 102 c.

It should be appreciated that the client device 102 c may also be communicating with another application server 1820, which is outside of the session with the application server. Collectively, the communications between the client device 102 c and the application server 1804 via the communication path may be referred to as first data packets and communications between the client device 102 c and all other application servers including server 1820 and the malicious server 1802 are second data packets.

In this example, the network switches 1806 are configured to route the first data packets between the client device 102 c and the application server 1804. The network switch 1806 is also configured to monitor the source and destination address of the communications from the client device 102 c and designate the client device 102 c as a protected device conditioned on determining that the application server 1804 accessed by the client device 102 c is included within a data structure of specified endpoints (e.g., endpoints of financial institutions) or belongs to a pre-specified set of endpoints. In other words, the network switches 1806 are coded or configured to detect data packets with a destination address equal to the address of the application server 1804. An operator of the application server 1804 may request inclusion (after verification) to be included within the set. Alternatively, an operator of the security processor 112 or the security processor itself 112 may determine that an IP address of the application server 1804 is to be included within the set. Accordingly, when the switch 1806 detects a data packet from the client device 102 c with a destination of the application server 1804, the IP address of the source (e.g., the client device 102 c) becomes a protected address. In some instances, the security processor 112 and/or the controller 1810 may make the designation. Additionally, in some instances, the switches 1806 may communicate (in conjunction with controllers 1810) to determine which is closer along the communication path 1816 to the client device 102 c to determine which should perform the above described actions.

The example switches 1806 may also inspect traffic to and/or from the client device 102 c for IP addresses and ports. Any IP addresses and/or ports that do not correspond to the application server 1804 or the client device 102 c are designated as ‘other devices’ and the corresponding IP traffic is designated as second data packets. The switch 1806 may prepare to filter or otherwise control these second data packets based on signaling from within the client device 102 c session with the application server 1804.

In the illustrated example, the website operated by the application server 1804 may include a plug-in application, object, widget, etc. that includes computer-readable security instructions 1822 for relaying information from within the session regarding activity of the web browser. The instructions 1822 are configured to, for example, operate an application that determines when the webpage 1812 is about to be rendered and/or within the final stages of being rendered or displayed. The instructions 1822 are also configured to operate an application that determines the internal state of the web browsing application accessing the website of the application server and transmit a signal packet to the switch 1806 indicative of the rendering and/or the internal state.

The example instructions 1822 are also configured to operate an application that detects or determines when information is being provided into one or more fields of the webpage either by a user of the client device 102 c via the malicious server 1802. For example, the application defined by the instructions 1822 may detect when a username is being entered into a username field or when a password is being entered into a password field. The instructions 1822 may, for example, include identifiers of the field names of note, including any variation caused by the security processor 112 of FIGS. 1 to 17. The example application defined by the instructions 1822 may also note other operations of the webpage or website including mouse clicks, mouse movements, object selections, hyperlink selections, etc. Further, while password and username fields were mentioned above, the types of information is virtually limitless. For example, dollar amount fields may be monitored.

The example application defined by the instructions 1822 is configured to transmit a signaling message that indicates some manipulation of a webpage is occurring (e.g., typing of a username). As the username, for example, is being typed (or after the username is typed), the application defined by the instructions 1822 sends an indication of the action occurring within the signaling message. The example application defined by the instructions 1822 is configured to transmit another signaling message that indicates further manipulation of a webpage is occurring (e.g., typing of a password). It should be appreciated that the number of signaling messages created and transmitted by the application defined by the instructions 1822 is virtually limitless and may occur for each webpage of the related website. As discussed in more detail below, the switch 1806 and/or the controller 1810 are configured to use the signaling messages (and/or the timing of the signaling messages) to determine a presence of the malicious server 1802.

The example switch 1806 and/or the controller 1810 are configured to use the signaling from the application defined by the instructions 1822 to determine how user inputs change when out-of-session IP traffic is controlled. The switch 1806 and/or the controller 1810 receives the signal packets from the client device 102 c and performs specified or coded routines based on the content of the packets. For example, controller 1810 may determine that a signaling packet includes an indication that the webpage 1822 is being rendered. In response, the controller 1810 may operate in conjunction with the switch 1806 to determine the IP addresses and/or ports of the second data packets being transmitted to and/or transmitted from the client device 102 c. Additionally, the controller 1810 may determine that a signaling packet includes a first indication that information is being entered into a field of the webpage 1822 (e.g., a username is being entered into the webpage). In response, the controller 1810 instructs the switch 1806 to control, throttle, delay, or otherwise queue the second data packets. Further, the controller 1810 may determine that another signaling packet includes a second indication that information is being entered into a field of the webpage 1822 (e.g., a password is being entered into the webpage).

The example controller 1810 is configured to compare the signaling packets to determine if there was an effect from controlling the second data packets. For instance, receiving a password signaling packet after a specified duration or time period may indicate that the control of the second data packets caused the malicious server 1802 to take longer to provide the password. The specified duration may range from ¼ second to 10 minutes, but is more preferably between 10 seconds and 60 seconds. In some instances, the follow-up signaling packet may never be received, which is also an indication of the malicious server 1802. However, if the second, or follow-up signaling packet is received within an appropriate amount of time, the controller 1810 is configured to instruct the switch 1806 to resume normal routing of the second data packets.

In alternative embodiments, the controller 1810 may cause multiple throttling or controlling of the second data packets. For example, the controller 1810 may cause a first brief interruption of routing of second data packets during a username being entered into a field on the webpage 1812, a second brief interruption during a password being entered, and third or more brief pauses as payment information is entered into bill pay pages. Under such a scenario, the controller 1810 is configured to cause each of the first to third brief interruptions of the routing of the second data packets to have a duration of about ½ second, for example. The controller 1810 is configured to use these multiple interruptions to rule out false alarms by operating the short, brief interruptions many times and determining if there is a correlation between the interruptions and stoppages in the typing, thereby indicating that the malicious server 1802 is remote controlling a browser on the client device 102 c.

In some instances, the controller 1810 may compare the IP addresses of the second data packets to a data structure that includes white or approved IP addresses. In this instance, the controller 1810 may instruct the switch 1806 to control second data packets that are not related to an approved IP address. Such a configuration reduces the number of second data packets that are throttled or otherwise modified to only second data packets received from or communicated to IP addresses that are not on an approved list.

It should be appreciated that the controller 1810 and/or the security processor 112 operating on the controller 1810 may be configured to not only compare the timing of the signaling packets but also the content. For example, a first signaling packet may indicate a username being entered. The example controller 1810 may then receive subsequent signaling packets associated with mouse movement or hyperlink selection. However, the controller 1810 may only enable the switch 1806 to resume normal routing only if the control packet associated with a password being entered is received. It is contemplated that such actions are usually performed in conjunction by a user and there is a high confidence or probability that signaling packets associated with these actions should be received within a short time period. In another example, a first signaling packet may mark the beginning of a mouse movement and a second signaling packet being searched by the controller 1810 corresponds to the end of a mouse movement. The expectation is that the movement of the mouse or cursor is constant regardless of control of the second data packets. Accordingly, the purpose or goal of the controller 1810, the switch 106, and/or the security processor 112 operating on the controller 1810 is to determine if there is repeatable evidence of a cause-effect relationship between in-session web browsing client activity (or database access or API access) of the webpage 1812 and out-of session second data packets.

The example controller 1810 and/or the switch 1806 may also add and/or modify a header field of the second data packets to provide an indication, to the security server 112, of the multi-tenancy state of the client device 102 c. The controller 1810 may, for example, change a destination address of the second data packets (or some of the second data packets) to a destination address of the security server 112 (either remote or local). The security server 112 uses the second data packets for tracking or analysis to identify, for example, the malicious server 1802 or an endpoint IP address of the malicious server 1802.

The example controller 1810 may also perform operations to protect the client device 102 c and/or the application server 1804 from the malicious server 1802. These operations include transmitting a message, alert, etc. to the client device 102 c, the application server 1804, and or the security processor 112 providing an indication of the malicious server 1802. The controller 1810 may also instruct the switch 1806 to restrict or otherwise block all IP traffic to/from the client device 102 c and/or the malicious server 1802. The controller 1810 may also instruct the application operating the instructions 1822 to close the web browsing application, the webpage 1812, and/or the website. The controller 1810 may also instruct the application operating the instructions 1822 to display a message to a user of the client device 102 c indicating the presence of the malicious server 1802.

As discussed above, the switch 1806 is configured to control the second data packets. Control may include, for example, delaying or queuing the second data packets destined for the client device 102 c. Control may alternatively or additionally include delaying or queuing the second data packets transmitted from the client device 102 c. Control generally may include throttling, delaying, slowing, queuing, or deleting the second data packets. Control may further include changing the contents of the second data packets including the names of labels.

In some embodiments, the controller 1810 may specify a delay or queue time for the second data packets and accordingly use this time as a basis for determining a threshold for receiving a signaling packet. For example, the delay may be two minutes and the threshold may also be two minutes. This configuration ensures that it is virtually impossible for the malicious server 1802 to receive a screenshot of the client device 102 c and provide a password back to the client device 102 c within two minutes because all IP traffic (including the screenshot and the password command) between the client device 102 c and the server 1802 along the communication path 1814 is queued separately by the switch 1806 b for two minutes. Under this scenario, the fastest the malicious server 1802 may respond would be four minutes. It should be appreciated that the control delay does not have to exactly equal the signaling packet timing threshold. For instance, there could be a proportional relationship or other relationship based on an algorithmic determination of a fastest time for the malicious application 1802 to respond to the client device 102 c based on a specified queue delay.

FIG. 19 shows a chart 1900 that compares events known in-session from within an application (e.g., a web browsing application) and events known outside the application. In this example, the switch 1806 and/or the controller 1810 are configured to read un-encrypted portions of data packets that include IP addresses, ports, packet sizes (e.g., data related to events known outside of an application). However, the switch 1806 and/or the controller 1810 do not have access to the content to the data packets (e.g., data related to events known inside an application). Instead, the application operating the instructions 1822 is able to monitor and accordingly report within signaling packets the in-session activity of the application. This includes webpage rendering information, keyboard inputs, mouse inputs, etc.

Flowchart of the Example Process For the Remote Access Embodiment

FIGS. 20 and 21 are a flow diagram showing example procedures 2000, 2030, and 2060 to determine if a malicious server 1802 is remote controlling a client device 102, according to an example embodiment of the present invention. Although the procedures 2000, 2030, and 2060 are described with reference to the flow diagram illustrated in FIGS. 20 and 21, it will be appreciated that many other methods of performing the acts associated with the procedures 2000, 2030, and 2060 may be used. For example, the order of many of the blocks may be changed, certain blocks may be combined with other blocks, and many of the blocks described are optional.

The example procedures 2000, 2030, and 2060 are described in conjunction with a specific banking example. However, it should be appreciated that the example procedure may be used in other contexts and applications. In the banking example, the level of security scrutiny surrounding an Internet connection is lessened if the client endpoint is a previously seen or verified device, with familiar geolocation data, proper account authentication credentials, consistent CPU fingerprints, etc. For these reasons, one popular method of maliciously hacking into a protected network or content provider is to enter through a legitimate or verified endpoint. The malicious individual or attacker first asserts control over the victim's endpoint device (e.g., the client device 102 c) and then at some later time accesses the victim's device from a remote location to perpetrate fraud. From the point of view of the bank server, the connection request physically appears to originate from a legitimate client endpoint. Unseen from the bank server's vantage point is the simultaneous in time connection between the remote command and control endpoint of the malicious device 1802 and the victim's device.

Many network monitoring technologies exist that have visibility over the totality of a client's endpoint connections. Unfortunately, the mere fact that a client endpoint may have several arbitrary connections active at the same time as a sensitive accessing (banking) session is, in and of itself, no proof of a present malicious server. For instance, streaming music, instant messaging, file sharing, and many other applications all have long lived ‘always on’ Internet connections. Many of these connections are also encrypted, which makes any meaningful analysis more difficult.

In the example procedure 2000 of FIG. 20, a security application (e.g., the security processor 112 or specified by instructions 1822) is operating on the client device 102 to provide visibility inside a client browser or other application that is in-session with the banking server (e.g., the in-session activity discussed in connection with FIG. 19). The security application may be included within a website provided by the banking server such that a user of the client device does not have to install an application. Alternatively, the banking server may request for the user to install the security application, or the security application may have already been installed at the client device.

As illustrated in FIG. 20, the client device 102 transmits a request message 2001 to access the banking website (block 2002). A switch 1806 relays the request message 2001 across the network 110 to other switches and routers, such that the request message 2001 is received at the banking server. In response, the banking server transmits HTML code related to the website including HTML code for one or more webpages. The security application on the client device then detects or determines when the HTML code of the webpage from the banking server is about to be rendered into a webpage on a web browser (block 2004). The security application transmits a first signaling packet 2005 that includes an indication of the rendering of the webpage (block 2006). The security application also has visibility into specific activity of the web browser including individual keyboard strokes, mouse movements, etc. It should be appreciated that this level of in-session, detailed application information is not accessible by any network traffic monitoring technology even in the complete absence of encryption because this information is generally not transmitted from the client device. It should also be appreciated that the first signaling packet, and other signaling packets, are addressed to either the switch 1806, the controller 1810, or the banking server. This HTTP traffic is visible within the switch 1806 or the controller 1810 and may include an Ajax request. The signaling packets enable in-session activity and state changes to be passed along for network monitoring devices to discover.

In the illustrated example, the security application detects a username being entered (block 2008) and transmits a second signaling packet 2009 indicative of the username being entered (2010). The security application also detects a password being entered (block 2012) and transmits a third signaling packet 2013 indicative of the password being entered (2014). It should be appreciated that the security application may detect other in-session activity and transmit corresponding signaling packets.

The example security application may also be configured to determine if action should be taken to protect the client device 102 from the malicious device 1802 (block 2016). For instance, responsive to receiving a message 2017 indicative of the malicious server 1802, the security application may close or suspend the web browsing application or cause a message to be displayed to a user of the client device (block 2018). The security application may also cause a network interface of the client device to suspend communications. After the client device 102 has been protected or the session with the banking server ends, the example procedure 2000 ends.

Procedure 2030 operates on, for example, the network switch 1806. In the illustrated example of FIGS. 20 and 21, the example switch 1806 receives the request message 2001 from the client device 102 and routes the message to the banking server (block 2032). The switch 1806 also routes subsequent first data packets between the banking server and the client device 102 associated with the banking session. The switch 1806 may also determine that the banking server or an IP address of the banking server is included on a pre-specified list (block 2034). Responsive to making the determination, the switch 1806 designates the client device as a protected device (block 2036).

The switch 1806 receives the first signaling packet 2005, which is transmitted to the controller 1810 (block 2038). The first signaling packet may be destined for the banking server, in which case the switch routes the signaling packet to the banking website and makes a copy for the controller 1810 or changes the destination address of the signaling packet to the controller 1810. After receiving a control instruction message 2039 from the controller 1810, the switch 1806 determines second data packets associated with the client device (block 2040). As described above, the second data packets are data packets communicated between the client device and servers other than the banking server. It should be appreciated that the determination of the second data packets occurs as long as the client device is designated as a protected device or when the controller 1810 provides an indication otherwise. Such a configuration enables the switch 1806 to signal or provide an indication to the controller 1810 of network-level conditions of the protected endpoint client device 102. The indication may include whether the client device 102 has a single active connection to the banking server, multiple active connections to other servers, etc. The switch 1806 may provide signaling of the network-level condition to the security application on the client device or a remote security processor by modifying HTTP headers of the active banking connection. The signaling may be accomplished through the use of a SDN feature of the switch 1806. In instances where signaling is provided from the switch 1806 to the client device, the security application instead of the controller 1810 may instruct the switch when to throttle the second data packets.

The switch 1806 also receives the second signaling packet 2009, which is likewise transmitted to the controller 1810 (block 2042). After receiving control message 2043 from the controller 1810, the switch 1806 controls (e.g., throttles) the second packets (block 2044). During or after the control of the second data packets, the switch 1806 receives the third signaling packet 2013, which is transmitted to the controller 1810 (block 2046).

The switch 1806 may determine if normal routing of the second data packets should resume (block 2048). For instance, the switch 1806 may receive control message 2049 from the controller 1810, which instructs the switch to end control of the second data packets (block 2050). Alternatively, the control message 2049 may instruct the switch 1806 to continue to control or modify control the second data packets (e.g., control only some of the second data packets corresponding to a subset of IP addresses or change a queue time) (block 2052). The switch 1806 continues these actions in blocks 2050 and 2052 until the controller 1810 specifies otherwise, at which point the procedure 2030 ends.

The example procedure 2060 operates on, for example, the controller 1810 of FIG. 18. The procedure 2060 begins when the controller 1810 receives the first signaling packet 2005 (block 2062). The controller 1810 transmits message 2039 instructing the switch to determine second data packets of devices other than the banking server that are communicating with the client device 102 (block 2064). The controller 1810 then determines IP addresses associated with the second data packets for potentially white-listing some devices or as a way to identify a malicious endpoint (block 2066).

Some time later the controller 1810 receives the second signaling packet 2009 (block 2068) and transmits message 2043 instructing the switch 1806 to control the second data packets (block 2070). The controller 1810 then receives the third signaling packet 2013 (block 2072) and determines if the third signaling packet is within specified parameters (block 2074). The specified parameters may include a time period that starts when the second signaling packet is received and/or an action/activity that is complementary or corresponds to an action or activity specified by the second signaling packet 2009. For instance, entering a password after a username or entering a value after selecting a bill pay object within a webpage. In some instances, the third signaling packet may not be received by the controller 1810, which may strongly indicate the presence of the malicious server 1802.

If the controller 1810 determines the contents of the third signaling packet 2013 are within specified parameters or the third signaling packet 2013 is received within specified parameters, the controller 1810 transmits message 2049 instructing the switch 1806 to resume normal routing of the second data packets (block 2076), at which point the procedure 2060 may end. However, if the controller 1810 determines the third signaling packet 2013 is not within specified parameters, the controller 1810 determines the presence of the malicious server 1802 (block 2078). The controller 1810 may transmit a message 2017 indicating the malicious server 1802 to the client device 102 or the security application (block 2080). The controller 1810 may also perform actions to protect the client device 102 from the malicious server or transmit messages to other entities indicative of the malicious server. At this point the example procedure 2060 may end.

The controller 1810 is accordingly configured to determine if affecting or throttling out-of-session non-banking traffic has an effect on the in-session activity of the web browsing application. For example, if typing abruptly stops after the second data packets are throttled, the controller 1810 is configured to determine that a command and control channel of the malicious server 1802 is present and providing the keystrokes. It should be appreciated that the procedures 2000, 2030, and/or 2060 may be repeated several times for any given session to reduce the chances of a false positive.

Conclusion

It will be appreciated that all of the disclosed methods and procedures described herein can be implemented using one or more computer programs or components. These components may be provided as a series of computer instructions on any conventional computer-readable medium, including RAM, ROM, flash memory, magnetic or optical disks, optical memory, or other storage media. The instructions may be configured to be executed by a processor, which when executing the series of computer instructions performs or facilitates the performance of all or part of the disclosed methods and procedures.

It should be understood that various changes and modifications to the example embodiments described herein will be apparent to those skilled in the art. Such changes and modifications can be made without departing from the spirit and scope of the present subject matter and without diminishing its intended advantages. It is therefore intended that such changes and modifications be covered by the appended claims. 

The invention is claimed as follows:
 1. A network security apparatus comprising: a network switch configured to: route first data packets between a client device and a content provider device, designate the client device as a protected device conditioned on determining the content provider device accessed by the client device belongs to a pre-specified set of endpoints, determine at least one of Internet Protocol (“IP”) addresses and ports of other devices that transmit second data packets to or receive second data packets from the protected device, and throttle the second data packets destined for the protected device; and a controller communicatively coupled to the network switch, the controller configured to: receive a first signal packet from the protected device via the network switch indicative that a webpage provided by the content provider device to the client device is to be rendered, receive a second signal packet from the protected device via the network switch indicative that a username is being typed into a username field of the webpage, instruct the network switch to throttle the second data packets after receiving the second signal packet, receive a third signal packet from the protected device via the network switch indicative that a password is being typed into a password field of the webpage, instruct the network switch to resume normal routing of the second data packets conditioned on the third signal packet being received within a specified time period beginning when the second signal packet was received, and responsive to not receiving the third signal packet within the specified time period, provide an indication of a malicious device remotely controlling the protected device.
 2. The apparatus of claim 1, wherein the network switch is to throttle the second data packets by at least one of: throttling the second data packets destined for the protected device, and throttling the second data packets transmitted from the protected device.
 3. The apparatus of claim 1, wherein the controller is configured to use the third signal packet to determine that the user typed the password and the throttled second packets did not include password keystroke commands.
 4. The apparatus of claim 1, wherein throttling includes at least one of delaying, slowing, and queueing the second data packets.
 5. The apparatus of claim 1, wherein the network switch is configured to throttle the second data packets by delaying the routing of the second packets by a specified time duration.
 6. The apparatus of claim 1, wherein the network switch is configured to determine the least one of the IP addresses and the ports of the other devices after the controller receives the first signal packet.
 7. The apparatus of claim 1, wherein the network switch includes at least one of a network router, a network access point, and a network gateway.
 8. The apparatus of claim 1, wherein the controller includes a software-defined network (“SDN”) controller configured to communicate with the network switch via an application programmable interface (“API”).
 9. The apparatus of claim 1, wherein the first signal packet includes information indicative of an internal state of at least one of the webpage and a browser application operating on the protected device that accesses the webpage.
 10. The apparatus of claim 1, wherein the first signal packet, the second signal packet, and the third signal packet are received by the controller from a security application operating in conjunction with the webpage configured to monitor activity of the webpage.
 11. The apparatus of claim 10, wherein the security application is configured to determine when username is typed into the browser application and when the password is typed into the browser application.
 12. The apparatus of claim 1, wherein the first signal packet includes information indicative that the webpage is within a final stage of being rendered.
 13. A method to protect a client device from a malicious device comprising: determining, via a controller, the client device is transmitting and receiving first data packets from a content provider device; designating, via the controller, the client device as a protected device conditioned on determining the content provider device accessed by the client device belongs to a pre-specified set of endpoints; receiving, via the controller, a first signal packet from the protected device indicative that a webpage provided by the content provider device to the client device is to be rendered; after receiving the first signal packet, determining, via the controller, at least one of IP addresses and ports of other devices that transmit second data packets to or receive second data packets from the protected device; receiving, via the controller, a second signal packet from the protected device indicative that a user of the protected device has begun to type a username; instructing, via the controller, a network switch to control the second data packets after receiving the second signal packet; determining, via the controller, whether a third signal packet has been received within a specified time period beginning after the second signal packet was received; responsive to determining the third signal packet was not received within the specified time period, determining the malicious device provided the username; responsive to determining the malicious device provided the username, providing, via the controller, an indication the malicious application provided the username; protecting at least one of the protected device and the content provider device from the malicious device; and responsive to determining the third signal packet was received within the specified time period, instruct, via the controller, the network switch to resume normal routing of the second data packets.
 14. The method of claim 13, further comprising transmitting, via the controller, a message to at least one of the content provider device and the protected device including an indication of the malicious device.
 15. The method of claim 13, wherein protecting the at least one of the protected device and the content provider device includes instructing the network switch, via the controller, to block communications with the protected device.
 16. The method of claim 13, wherein protecting the at least one of the protected device and the content provider device includes instructing a security application operating in conjunction with the webpage to close the webpage.
 17. The method of claim 13, wherein the malicious device includes at least one of a darkComet server and a poison ivy server.
 18. The method of claim 13, wherein the specified time period includes a value between ten seconds and one minute.
 19. The method of claim 13, wherein the specified time period is based on the controlling of the second data packets.
 20. The method of claim 19, wherein the specified time period is proportional to a control time of the second data packets.
 21. The method of claim 13, further comprising: determining, via the controller, the IP address of at least one of the other devices is white listed within a data structure of approved endpoints; and instructing, via the controller, the network switch to resume normal routing of the second data packets associated with the determined IP address.
 22. The method of claim 13, further comprising: modifying, via the controller, a header field of the second data packets to transmit the second data packets to a third-party security server for tracking or analysis.
 23. A machine-accessible device having instructions stored thereon that, when executed, cause a machine to at least: detect within a web browsing application operating on a client device that a webpage from a content provider server is being rendered; transmit a first signal packet to a controller via a network switch indicative that the webpage is in the process of being rendered; detect a user of the client device has begun to type first information into a first field of the webpage; transmit a second signal packet to the controller indicative that the user has begun to type the first information; detect the user of the client device has begun to type second information into a second field of the webpage; transmit a third signal packet to the controller indicative that the user has begun to type the second information; and responsive to receiving a protection message from the controller, instruct the web browsing application to suspend operation.
 24. The machine-accessible device of claim 23, wherein the content provider server includes at least one of an application server or a database server.
 25. The machine-accessible device of claim 23, wherein the first information includes a username and the second information includes a password.
 26. The machine-accessible device of claim 23, having instructions stored thereon that, when executed, cause the machine to at least: determine a second webpage linked to the first webpage from the content provider server is being rendered; transmit a second first signal packet to the controller indicative that the second webpage is in the process of being rendered; determine the user of the client device has selected an object located on the second webpage; transmit a second signal packet to the controller indicative that the user has selected the object; determine the user of the client device has begun to type information into a field of the second webpage associated with the object; transmit a third signal packet to the controller indicative that the user has begun to type the information; and instruct the web browsing application to suspend operation responsive to receiving a protection message from the controller. 